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Chapter  1. 


INTRODUCTION 


There  exists  an  assortment  of  software  complexity  measures  in  the 
literature,  some  of  them  celebrated.  This  report  describes  the  examination  of 
two  different  complexity  measures.  The  first  one  is  a  source  code  complexity 
measure  constructed  by  Cantone,  Cimitile,  and  Sansone1  in  which  the 
complexity  of  a  program  is  computed  as  a  polynomial.  The  second  one,  also 
computed  as  a  polynomial,  is  a  data  structure  complexity  measure.  This 
measure  was  developed  by  Tsai,  Lopez,  Rodriguez,  and  Volovik2. 

Software  complexity  is  made  up  of  two  parts,  namely,  complexity  of  the 
source  code  and  that  of  the  data  structure.  They  are  called  source  code  complexity, 
and  data  structure  complexity,  respectively.    Discussion  of  software  complexity  has 


Cantone,  Giovanni,  Aniello  Cimitile,  and  Luclo  Sansone,  "Complexity  In  Program  Schemes: 
The  Characteristic  Polynomial",  SIGPLAN  Notices,    vol.  18,  no.  3,  March  1983. 

*Tsai,  W.T.,  M.A.  Lopez,  V.  Rodriguez,  and  D.  Volovik,  "An  Approach  to  Measuring  Data 
Structure  Complexity",  Proceedings  of  IEEE  Computer  Society's  Tenth  International  Computer 
Software  and  Applications  Conference,    1986,  pp.  240-2-56. 
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almost  always  meant  the  complexity  of  the  source  code  and  rarely  referred  to 
the  complexity  associated  with  the  data  structure. 

The  source  code  complexity  measure  that  this  study  concentrates  on  and 
as  developed  by  Cantone  and  others,  is  called  characteristic  polynomial.  Giving 
credit  to  the  developers  and  for  lack  of  better  names,  the  data  structure 
complexity  measure  will  be  called  Tsui's  data  structure  complexity  measure  or  Tsui's 
polynomial 

In  addition  to  a  description  of  the  two  complexity  measures,  the  inquiry 
in  this  study  further  involves: 

1.  investigation  of  the  existence  of  equivalence  classes  and/or  partial  orderings 
related  to  characteristic  polynomial  and  to  Tsai's  data  structure 
complexity, 

2.  development  of  a  mapping  measure  which  maps  the  characteristic 
polynomial  [and  Tsai's  polynomial]  on  to  real  numbers  for 
straightforward  comparison, 

3.  verification  that  the  mapping  measure  preserves  the  partial  order 
of  the  characteristic  polynomial  and  Tsai's  polynomial, 

4.  validation  of  both  the  complexity  measures  utilizing  a  complexity 
measures  validation  paradigm,  and 

5.  discussion  of  the  possibility  of  combining/collapsing  of  characteris- 
tic polynomial  and  Tsai's  polynomial  to  arrive  at  an  attended  software 
complexity  measure. 
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1.1.  ORGANIZATION  OF  THE  REPORT: 

Chapter  2  describes  characteristic  polynomials  as  developed  by  Cantone, 
Cimitile,  and  Sansone  and  describes  its  properties.  Tsai's  polynomials 
constructed  by  Tsai,  Lopez,  Rodriguez,  and  Volovik  and  their  properties  are 
covered  in  Chapter  3.  A  few  mapping  measures  that  map  these  polynomial 
complexity  measures  to  real  numbers  are  introduced  in  Chapter  4.  Investiga- 
tion of  the  existence  of  equivalence  classes  and/or  partial  orders  with  respect 
to  characteristic  polynomials  and  Tsai's  polynomials  are  examined  respectively 
in  Chapters  5  and  6.  A  brief  discussion  of  problems  in  combining  characteris- 
tic polynomials  and  data  structure  complexity  is  given  in  Chapter  7. 

Characteristic  polynomials,  as  defined  by  Cantone,  et.  al.,  are  obtained 
using  directed  graphs  whereas  Tsai's  data  structure  complexity  polynomial 
that  gauges  the  complexity  of  a  program's  data  structure  is  obtained  utilizing 
directed  multigraphs.  Even  though  Cantone  and  others  have  discussed  the 
development  of  characteristic  polynomials  by  basing  them  on  directed  graphs, 
their  measure  is  really  based  on  structured  flowgraphs.  A  discussion, 
therefore,  of  digraphs,  directed  multigraphs,  flowgraphs,  structured  flowgraphs, 
and  other  related  concepts  is  perceived  to  be  useful.  However,  to  avoid  such 
a  discussion  from  cluttering  the  main  aspects  of  the  study,  it  is  presented  in 
the  appendix  [see  Appendix  A].  Additionally,  concepts  associated  with 
relations,  orders,  and  flowgraph  transformations  which  are  used  in  this  study 
are  also  discussed  in  Appendix  A. 
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Appendix  B  describes  a  complexity  measure  validation  paradigm  as 
proposed  by  Baker,  Bieman,  Gustafson,  and  Melton3.    This  paradigm  is  used 
in  validating  the  above  two  complexity  measures. 


Baker,  Albert  L.,  James  M.  Bieman,  David  A.  Gustafson,  and  Austin  C.  Melton,  "Modeling 
and  Measuring  the  Software  Development  Process",  Proceedings  of  Hawaiian  International 
Conference   on   Computers   and  Software,    1987,  pp.  23-30. 


Chapter  2. 

CHARACTERISTIC  POLYNOMIALS 


2.1.  SOURCE  CODE  COMPLEXITY: 

Complexity  measures  have  traditionally  been  constructed  for  programs 
rather  than  for  data  structures.  This  section  describes  a  source  code  com- 
plexity measure  called  characteristic  polynomial. 

Complexity  measures,  most  notably  McCabe's  cyclomatic  number  and 
Halstead's  software  science  measure,  do  not  consider  control  environments, 
cycles,  selections,  and  nesting.  Cantone,  et.  al.,  construct  a  complexity 
measure  that  takes  into  account  the  structural  characteristics  of  the  control 
flow  of  a  program  and  the  number  of  its  possible  executions.  Their  construc- 
tion is  called  the  characteristic  polynomiaC.  A  program's  characteristic  polynomial  is 
easily  constructed  from  its  flowgraph. 
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2.2.  FLOWGRAPHS  versus  CHARACTERISTIC  POLYNOMIALS: 

Flowgraphs  have  been  the  basis  for  several  complexity  measures.  Even 
though  characteristic  polynomials  could  be  constructed  without  utilizing  flow- 
graphs,  it  is  much  easier  to  visualize  the  computation  of  characteristic 
polynomials  with  the  aid  of  flowgraphs.  Furthermore,  characteristic  poly- 
nomials are  constructed  in  this  study  [as  well  as  in  Cantone4  and  others' 
original  work]  only  for  structured  flowgraphs  that  avoid  flowgraph  representa- 
tion for  spaghetti  code.  Structured  flowgraphs/programs  have  become  the 
norm  and  hence,  computing  characteristic  polynomials  for  unstructured 
programs  is  not  attempted. 

Since  the  complexity  of  source  code  is  related  to  the  nesting  of  its 
control  structures,  the  complexity  increases  as  the  level  of  nesting  of  the 
control  structures  of  the  program  increases.  In  constructing  the  characteristic 
polynomial,  therefore,  different  criteria  are  applied  to  obtain  the  various 
components  of  the  characteristic  polynomial. 

2.3.  COMPUTATION  OF  CHARACTERISTIC  NUMBERS  AND 
POLYNOMIALS: 

The  characteristic  polynomial  of  a  structured  flowgraph  is  obtained  by 
combining  what  is  termed  by  Cantone4  et.  al.,  characteristic  numbers  of  its 
subgraphs  and  primitive  nodes.  Understandably,  a  characteristic  number  can 
be  associated  with  each  node  of  a  flowgraph.  The  following  criteria  are  used 
in  computing  the  characteristic  numbers: 


Cantone,    e^.    al..,    op_.    clt . 
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for  each  primitive  node  the  characteristic  number  is  i. 
for  a  structure  of  the  serial  type,  the  characteristic  number  is  given  by 
the  product  of  the  characteristic  numbers  of  the  structures  and  of  the 
primitive  nodes  which  are  immediately  contained  in  it.  This 
means  that  if  a  digraph  is  made  up  of  a  sequence  of  primitive 
nodes  representing  a  program  with  no  looping  and  branching 
statements,  then  its  characteristic  number  is  the  same  as  that 
of  a  single  primitive  node. 

for  a  structure  of  the  selective  type,  the  characteristic  number  is  given 
by  the  sum  of  the  characteristic  numbers  of  the  structures  and  of  the 
nodes  which  are  immediately  contained  in  it.  This  value  coincides 
with  the  number  of  total  paths  from  the  start  node  to  the 
terminal  node  of  the  program  flowgraph. 

for  a  cyclic  structure,  the  characteristic  number  is  given  by  the 
product  of  a  variable  c  and  the  characteristic  number  of  the  unique  structure 
or  primitive  node  immediately  contained  in  it.  Variable  c  is  used 
to  distinguish  the  structural  characteristics  of  the  control  flow  of 
a  program/flowgraph  and  consequently  the  structure  of  the 
characteristic  polynomial  expression. 

the  characteristic  number  of  a  program,  is  the  characteristic  number 
of  the  level  m  graph  associated  with  the  program. 

the  general  form  of  a  characteristic  polynomial  is  la,  c 


2.4.  PROPERTIES  OF  CHARACTERISTIC  POLYNOMIALS: 

Since  the  characteristic  number  of  a  representative  program  is  non- 
linear in  c,  it  is  called  the  characteristic  polynomial  of  a  program.  However, 
note  that  the  characteristic  polynomial  of  every  program  is  not  non-linear  in 
c. 

Briefly,   the   characteristic   polynomial   of  a   program   without   cyclic 

structures  is  of  degree  0  in  variable  c  and  the  value  of  the  characteristic 

polynomial  coincides  with  the  total  number  of  paths  of  the  program.     A 

program  or  subprogram  with  a  cyclic  structure  with  no  other  nested  cyclic 

structure  contained  in  it  is  represented  by  a  polynomial  of  degree   1  in 

variable  c.   The  coefficient  of  the  term  of  the  first  degree  coincides  with  the 

number  of  elementary  cycles  present  in  the  digraph.    Cantone,  et.  al.,  use  the 

phrase  exemplary  e&cutwns  of  a  cyclic  structure  to  denote  any  execution  that  involves 

only  the   execution  of  the  instructions   associated  with  the   nodes  of  one 

elementary  cycle  of  the    corresponding    digraph.     A  complete    characteristic 

a, 
polynomial  denoted  by  la,  .  c      ,  represents  a  program  for  which  there  are  la, 

distinct  exemplary  executions.      In  particular,   a,  are  exemplary  executions 

characterized  by  n  cycles,  a*.,  are  exemplary  executions  characterized  by  n-1 

cycles,  ...,  a„  are  exemplary  executions  without  cycles. 

AN  EXAMPLE: 

Structured  flowgraph  transformations,  viz.,  composition,  alternation,  and 
iteration  transformations,  discussed  in  Appendix  A,  are  incremental  and  progressively  complex. 
in  the  sense  that  if  a  e  A  is  a  structured  flowgraph  [where  A  is  a  set  of 
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flowgraphs]  with  some  measured  complexity  in  any  intended  component  of 
complexity,  then  any  structured  flowgraph  transformations  exercised  on  a  will 
not  reduce  the  complexity  of  o.  Therefore,  instead  of  computing  the  charac- 
teristic polynomials  for  a  sample  collection  of  flowgraphs-which  will  yield  non- 
comparable  characteristic  polynomials~the  computation  is  made  for  flowgraphs 
which  are  the  result  of  successive  employment  of  structured  flowgraph 
transformations  on  a  given  flowgraph.  Note  that,  based  on  an  assumption 
made  in  the  software  complexity  measures  paradigm  [see  Appendix  B],  the  set  A  can  be 
treated  as  a  preordered  set. 

The  application  of  the  structured  flowgraph  transformations  that  can  be 
performed  on  a  structured  flowgraph  will  be  obvious  in  the  following  example. 
Composition,  alternation,  and  iteration  transformations  have  been  used  to 
arrive  at  the  successive  flowgraphs.  The  characteristic  polynomial  is  also 
computed  and  shown  alongside  for  each  flowgraph.  Structured  flowgraph 
transformations  modify  characteristic  polynomials  in  such  a  fashion  that  it 
becomes  difficult  to  compare  them  directly.  Further,  one  natural  objective  in 
software  complexity  measures  research  is  to  map  software  complexity  meas- 
ures to  real  numbers.  Hence,  measures  that  reduce  the  characteristic 
polynomials  to  real  numbers  will  be  adopted.  These  measures  make  it 
possible  to  compare  the  complexity  of  the  flowgraphs  directly. 

Structured  flowgraphs  that  are  increasingly  complex  due  to  successive 
application  of  structured  flowgraph  transformations  are  provided  in  the 
following  pages.    Table  2.1  presents  the  characteristic  polynomials  computed 
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CP=  4c 


Figure  2.1 
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CP=  Sc 


Figure  2.2 
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CP  -  6c2  +  2c 


Figure  2.3 
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CS  =  6t  +  2c  +  1 
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CP  =  6c?  +  4c 


Figure  2.5 
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CP  =  lOt 


Figure  2.6 
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CS  =  4c?  +  &» 
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cs  =  te  +  &» 


C2  =  8c<  +  & 


Figure  2.9 
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Flowgraph 

Kind  of  transformation 
applied  on  the  predecessor 

Characteristic 
Polynomial 

2.1 

Initial  flowgraph 

4c 

2.2 

Alternation 

5c 

2.3 

Iteration 

6c8  +  2c 

2.4 

Alternation 

6c2  +  2c  +  1 

2.5 

Iteration 

6c2  +  4c 

2.6 

» 

10c2 

2.7 

" 

4c3  +  8c2 

2.8 

■I 

8c3  +  6c2 

2.9 

M 

8c4  +  6c3 

Table  2.1.    Transformations  and  Characteristic  Polynomials. 
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for  these  structured  flowgraphs  along  with  the  kind  of  transformation  that 
modified  the  previous  flowgraph. 

Note  that  many  junction  nodes  in  Figure  2.1  through  Figure  2.9  have 
been  deliberately  left  out  since  it  reduces  the  clutter  and  these  junction  nodes 
have  no  effect  of  the  characteristic  polynomials. 

It  is  obvious  that  the  structured  flowgraphs  become  increasingly  complex 
as  composition,  alternation,  and  iteration  transformations  are  performed  on 
them.  Also  apparent  is  the  difficulty  in  directly  comparing  their  characteristic 
polynomials.  For  example,  looking  at  the  characteristic  polynomials  [without 
the  aid  of  the  flowgraph  diagrams]  for  flowgraphs  in  Figure  2.4  and  Figure 
2.5,  it  is  difficult  to  say  whether  the  characteristic  polynomial  of  flowgraph  4 
is  more  complex  than  that  of  flowgraph  5  or  vice  versa. 

Prior  to  coming  up  with  a  mapping  measure1,  let  us  determine  if  characteris- 
tic polynomials  obey  either  of  the  two  desired  orderings  in  discrete  mathe- 
matics, viz.,  equivalence  classes  and  partial  orders.  Since  characteristic 
polynomials  are  computed  from  structured  flowgraphs,  the  job  of  determining 
the  orderings  inherent  in  characteristic  polynomials  boils  down  to  confirming 
the  ordering  inherent  in  structured  flowgraphs. 


A  measure  that  maps  each  characteristic  polynomial  with  a  real  number  in  the  set  ft. 


Chapter  3. 


TSATS  POLYNOMIALS 


3.1.  DATA  STRUCTURE  COMPLEXITY: 

Characteristic  polynomials,  discussed  above,  measure  the  source  code 
complexity  and  in  so  doing  try  to  especially  weigh  the  characteristics  assoc- 
iated with  the  control  structures  of  a  program.  However,  they  do  not  take 
into  account  the  complexity  of  data  structures  which,  unquestionably,  con- 
stitutes a  major  portion  of  a  program's  overall  complexity. 

A  complexity  measure  of  the  data  structure  of  a  program,  as  presented 
here,  is  taken  from  Tsai,  Lopez,  Rodriguez,  and  Volovik".  Hereafter,  this  data 
structure  complexity  will  be  called  Isai's  data  structure  complexity  measure  or  briefly 
Isai's  polynomial.  Both  Tsai's  data  structure  complexity  measure  and  Cantone- 
Cimitile-Sansone's    source    code    complexity    measure    yield    similar-looking 

*Tsal,    et_.    al..    op.    cit . 
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polynomials.  The  similarity  of  these  two  measures  is  one  reason  behind  the 
investigation  of  Tsai's  data  structure  complexity  measure.  Besides  the 
similarity  of  these  polynomials,  the  real  motivation  in  studying  Tsai's  data 
structure  complexity  along  with  characteristic  polynomial  has  to  do  with  the 
desire  to  discover  if  these  two  complexity  measures  could  somehow  be 
combined  in  arriving  at  an  extended  compkigty  mtaswe. 

It  is  a  well  known  fact  that  complexity  measures  research  has  not 
focused  much  attention  on  studying  data  structure  complexity.  Equally  well 
known  is  the  fact  that  software  complexity  is  a  function  of  several  factors  and 
data  structure  is  one  of  them.  Note  additionally  that  the  data  structure  of  a 
software  system  is  available  from  the  beginning  of  software  life  cycle  and 
hence,  it  provides  a  way  to  estimate  and  predict  software  characteristics 
earlier  than  other  measures  based  on  source  code  alone.  Further,  data 
complexity  can  be  thought  of  as  a  harbinger  of  the  software  system  complexity 
since  the  structure  of  a  program  is  known  to  follow  the  structure  of  its  data 
very  closely. 

Method  of  computing  Tsai's  measure  is  discussed  below.  Instead  of 
computing  Tsai's  measure  for  a  single  involved  data  structure,  we  begin  here 
with  a  very  simple  data  definition  and  add  more  data  items  making  the  data 
structure  progressively  complex  as  we  did  in  the  case  of  characteristic 
polynomials.  Tsai's  data  structure  complexity  measure  is  computed  at  each 
stage. 
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3.2.  TSAI'S  DATA  STRUCTURE  COMPLEXITY  MEASURE: 

The  algorithm  for  computing  Tsai's  polynomial  requires  that  the  data 
structure  in  any  programming  language  satisfies  a  data  structure  description 
language.  This  data  structure  description  language  is  quite  general  in  that 
it  is  applicable  to  the  data  structures  of  almost  all  structured  programming 
languages. 
3.2.1.  A  Data  Structure  Description  Language: 

The  data  structure  description  language  as  outlined  by  Tsai,  et.  al., 
assumes  that  any  given  data  structure  ©  is  of  the  form 

<D  ::  fl®,,  ...,  <DJ,  n>  1. 

where  /is  a  data  structure  building  operator,  OVs  are  instances  of  references 
to  the  definitions  of  data  structures  used  to  build  £>,  and  n  is  a  number  of 
these  instances.  Note  that  ©,  can  be  <D  itself,  i.e.,  recursive  (self-referential) 
definitions  are  allowed.  Also  note  that  2Vs  can  be  atomic  data  structures,  i.e., 
the  ones  which  are  predefined  in  a  language  and  do  not  have  an  explicit 
definition  of  the  form  above. 

A  finite  set  Q.of  data  structure  definitions  can  then  be  written  as 

Q,-  fo   I  ©,  "f&p,  ...,  (D/J),       jl  <jn 
where  /'s  are  data  structure  building  operators  and  2>/s  are  instances  of 
references  to  data  structures. 

A  data  structure  definition  for  which  Tsai's  polynomial  is  to  be  con- 
structed is  expected  to  follow  the  data  structure  description  language. 
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3.2.2.  Step  1  ■  Building  a  Directed  Multigraph: 

After  verifying  that  the  data  structure  definition  of  a  program  follows 
the  data  structure  description  language,  a  directed  muttigrapA  is  constructed. 
There  is  one  directed  multigraph  for  each  distinct  data  structure  Q.in  a  given 
program.  To  build  the  multigraph,  create  a  unique  node  £.  for  each  data 
structure  definition  2),  and  a  unique  node  for  each  kind  of  atomic  data 
structure  referenced  by  some  data  structure  definition  2}  in  Q,  For  each  data 
structure  definition  2}  of  the  form  2}  .-:  //2}„  ...,  2}J  create  an  edge  directed  from 
d  to  df  for  each  instance  of  2^  referenced  in  the  definition  of  2},  reflecting  the 
fact  that  2}  is  dependent  on  the  definitions  of  25/s  when  2>/s  are  explicitly 
defined  in  a  or  on  the  atomic  data  structures  when  23/s  are  atomic.  Call  this 
multigraph  g. 

3.2.3.  Step  2  ■  Checking  for  Completeness: 

The  directed  multigraph  as  drawn  is  checked  for  completeness.  A  set  of 
definitions  is  complete  if  and  only  if  there  exists  a  path  from  any  node  cor- 
responding to  a  non-atomic  definition  to  some  node  corresponding  to  an  atomic 
definition.  Note  that  checking  for  completeness  can  be  performed  even  before 
building  the  multigraph.  However,  the  directed  multigraph  visually  facilitates 
the  check. 

3.2.4.  Step  3  -  Eliminating  Auxiliary  Definitions: 

For  each  node  in  the  directed  multigraph,  define  the  degree  of  a  node  as 
degree^  =  in-degreed  +  out-degree  f<0 
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where   in-degreed  is    the    total    number   of  edges   going   from   some   node 
[including  iQ  to  d^,  and  out-degree^  is  the  total  number  of  edges  going  from 
node  d  to  some  node  [including  <fl. 

Auxiliary  nodes  in  a  directed  multigraph  are  those  nodes  of  degree  at 
least  two  which  have  exactly  one  in-edge  [or  exactly  one  out-edge]  where  the 
only  in-edge  is  different  from  every  out-edge  and  no  out-edge  forms  a  loop  of 
length  two  with  the  in-edge,  [or  the  only  out-edge  is  different  from  every  in- 
edge  and  no  in-edge  forms  a  loop  of  length  two  with  the  out-edge].  All  such 
auxiliary  nodes  are  eliminated. 

This  amounts  to  eliminating  a  node  if  there  is  one  in-edge  to  the  node 
and  one  out-edge  from  the  node  with  the  condition  that  the  out-edge  does  not 
point  back  to  node's  parent. 

After  deleting  such  nodes,  replace  each  pair  of  edges  [the  in-edge  and 
the  out-edge  of  the  deleted  node]  by  just  one  edge  with  the  same  source  as 
the  in-edge  and  the  same  destination  as  the  out-edge. 
3.2.5.  Step  4  ■  Splitting  into  Strongly  Connected  Components: 

Two  nodes  d,  and  dj  in  a  directed  multigraph  are  said  to  be  strongly 
connected  if  and  only  if  there  are  paths  in  a  directed  multigraph  from  d,  to  <(  and 
from  dj  to  <4  Any  node  is  considered  to  be  strongly  connected  with  itself  even 
when  there  is  no  path  from  the  node  to  itself.  Define  a  strongly  connected 
component  of  a  directed  multigraph  to  be  a  subgraph  acof  Q,  such  that  each 
pair  of  nodes  in  acis  strongly  connected  via  some  path  in  %  Notice  that  Stis 
mwtimd  in  a  sense  that  no  node  in  Q  which  is  not  in  %_  already  is  strongly 
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connected  to  any  of  the  nodes  in  %    Further,  a  binary  relation  of  being 
strongly  connected  partitions  the  set  of  nodes  of  Q  into  a  set  of  equivalence 
classes. 

3.2.6.  Step  S  •  Deriving  Self-Complexity  Monomials: 

The  self-complexity  monomial  of  a  strongly  connected  component  takes 
into  account  the  number  of  nodes  and  edges  within  the  strongly  connected 
component  weighing  it  by  the  number  of  simple  circular  paths  within  the 
component. 

The  self-complexity  monomial  of  a  strongly  connected  component  S(dJ  is 
computed  using 

SCKl  -  S(dJ  •  &+<£}<!■ 
where  Vis  the  total  number  of  nodes  in  a  component  %  lis  the  total  number 
of  edges  in  the  component  a:  to  which  <£  belongs,  and  L  is  the  total  number  of 
simple  circular  paths  [circular  paths  which  do  not  include  proper  circular  sub- 
paths  in  them]  in  the  component  %  and  c  is  a  variable. 

Each  node  within  a  strongly  connected  component  is  assigned  the  same 
self-complexity  monomial.  This  monomial  S(dJ  is  a  measure  of  the  self- 
complexity  of  any  of  the  nodes  in  the  component  % 

3.2.7.  Step  6  -  Deriving  Data  Structure  Complexity  Polynomials: 
Having  computed  the  self-complexity  monomials  for  each  of  the  strongly 

connected  components,  data  structure  complexity  polynomial,  denoted  C(OQ,  is 
computed  by  aggregating  the  self-complexity  monomials. 
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Data    structure    complexity    polynomial    for    the    strongly    connected 
component  acis  then  defined  as 

cm  •  zicckj}  +  sm 

where  the  sum  is  taken  over  the  set  of  "K.  's  such  that  %.%,%.  and  ac  is 

adjacent  to  ao  i.e.,  there  is  an  edge  connecting  some  node  in  acto  some  node 
in  ai,  and  each  summand  CCK)  is  present  in  the  sum  as  many  times  as  there 
are  different  edges  directly  connecting  acto  %.. 

Complexity  of  a  node  is  defined  as  the  complexity  of  the  unique  strongly 
connected  component  to  which  this  node  belongs.      Tsai's  data   structure 
complexity  measure  of  a  data  structure  is  then  just  a  complexity  of  a  node 
corresponding  to  this  data  structure. 
AN  EXAMPLE: 

The  next  several  pages  contain  data  structure  definitions,  their  cor- 
responding multigraphs,  and  their  self-complexity  monomials  and  data 
structure  complexity  polynomials.  The  first  figure  shows  the  initial  data 
structure  that  we  started  with  and  the  others  display  the  incremental  addition 
to  their  predecessors.  This  was  done  with  the  conviction  that  it  will  indicate 
the  way  in  which  the  complexity  changes  as  the  data  structure  definition 
grows  more  complex.  Moreover,  each  modification  of  the  data  structure  can 
be  described  in  terms  of  the  directed  multigraph  transformations  [Appendix  A; 
Section  A.7]. 

Most  of  the  notations  in  the  figures  have  been  described  above.  In  the 
multigraphs  broken  lines  encircle  each  strongly  connected  component.    In  all 
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Figure  3.1 


DATA  STRUCTURE  AND  TSATS  POLYNOMIALS  FOR  FIGURE  3.1. 


Data  Structure: 

Type 

A  =  RECORD 
il :  integer; 
S  :    array  [1  ..  80]  of  char; 

END; 

Self-Complexity  Monomials: 

integer;  =    5fsubrange;  =    .sfcharj  =    s(k)  =    S(S)  =    1 

Tsai's  Complexity  Polynomials: 

CiintegerJ  =    .SiintegerJ  =    1 

flsubrange)  =    ^subrange;  =    1 

C(char)  =    S(char)  =    1 

C(S;  =    CfsubrangeJ  +    C/charj  +    5|SJ  =    3 

C(A)  =    C/integerj  +    c&)  +    S(k)  =    5 
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Figure  3.2 
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DATA  STRUCTURE  AND  TSATS  POLYNOMIALS  FOR  FIGURE  3.2. 


Data  Structure: 

Type 

A  =  RECORD 

il :    integer; 

S  :    array  [1  . 

.  80]  of  char 

fl:    AD; 

END; 

D  =  RECORD 

fl:    AD; 

rl :    real; 

END; 

Self-Complexity  Monomials: 

^(integer;  =    jfsubrange;  =    5fchar;  =    .sfreal;  =    S(A)  =    s$)  =    1 
S(D)  =    (1+1)  X1    =  2X 

Tsai's  Complexity  Polynomials: 

Ciinteger;  =    Jiintegerj  =    1 

CfsubrangeJ  =    5('subrangej  =    1 

C(char)  =    s(cha.r)  =    1 

Cfrealj  =    iireai;  =    1 

C(S;  =    ^subrange;  +    C(chai)  +    s(S)  =    3 

C(D)  =    qreal;  +    s(D)  =    1  +  2X 

OK)  =    Ciinteger;  +    c{8)  +    C(D)  +  S(A)  =    1  +  3  +  (1+2X)  +1    =    6  +  2X 


32 


Figure  3.3 
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DATA  STRUCTURE  AND  TSATS  POLYNOMIALS  FOR  FIGURE  3.3. 


Data  Structure: 


Type 

A  =  RECORD 


il 

S 

fl 

END; 


integer; 

array  [1  ..  80]  of  char; 
aD. 


D  =  RECORD 


fl 
rl 
£2 
END; 


aD; 

real; 
AG; 


G  =  RECORD 
fl:    AA; 

END; 

Self-Complexity  Monomials: 

5(integer;  =    ^subrange;  =    s(char)  =    5ireal;  =    S(S)  =    1 
S(A)  =  S(D)  =    S<G)  =    (3+4)  X2    =  7X2 

Tsai's  Complexity  Polynomials: 

ClintegerJ  =    Jdntegerj  =    1 
C/subrange,)  =    .Si'subrange;  =    1 
C(char)  =    stchar)  =    1 
Cfreal)  =    sfreal)  =    1 

CIS)  =    tfsubrangej  +    C(char)  +    S(S)  =    3 
CIA)  =    C(D)  =    c/G)  =    integer;  +    c(Sl  +    Cfrealj  +   S(A) 
=    1  +  3  +  1  +  7X2    =    5  +  7X2 
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Figure  3.4 


DATA  STRUCTURE  AND  TSATS  POLYNOMIALS  FOR  FIGURE  3.4. 


Data  Structure: 

Type 

A  =  RECORD 


il 
S 
fl 
£2 
END; 


integer; 

array  [1  ..  80]  of  char; 
aD; 

AL; 


L  =  RECORD 

cl :    char; 
END; 

D  =  RECORD 


fl 

rl 
£2 
END; 


aD; 

real; 
AG; 


G  =  RECORD 

fl:    AA; 
END; 

Note:  The  node  L  can  be  eliminated  as  an  auxiliary  definition  and  the  edges 
leading  to  and  emanating  from  L  can  be  replaced  by  an  edge  from  node 
A  to  node  char.    See  Figure  2.5. 
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DATA  STRUCTURE  AND  TSATS  POLYNOMIALS  FOR  FIGURE  3.5. 


Data  Structure: 

Type 

A  =  RECORD 

il 

integer; 

S 

array  [1  ..  80]  of  char; 

fl 

AD; 

£2 

AL; 

END; 

L  =  RECORD 

cl :    char; 

END; 

D  =  RECORD 

fl 

AD; 

rl 

real; 

f2 

AG; 

END; 

G  =  RECORD 

fl:    AA; 

END; 

Self-Complexity  Monomials: 

5(integer;  =    ifsubrangej  =    s(char)  =    S(real)  =    S(S)  =    1 
Slk)  =  S(D)  =    SlG)  =    (3+4)  X2    =  7X2 

Tsai's  Complexity  Polynomials: 

Ciinteger,/  =    Jiintegerj  =    1 
Cfsubrange;  =    5('subrangeJ  =    1 
Cfchar;  =    .tfcharj  =    1 
Cirealj  =    Sfreal)  =    1 

CIS)  =    C/subrangej  +    CftharJ  +    s0)  =    3 

C(A)  =    C(D)  =    C(G)  =    Ciinteger;  +    C(S)  +    Ctchai)  +    C(renA)  +   S(k) 
=    1  +  3  +  1  +  1  + 7X2    =    6  +  7X2 
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DATA  STRUCTURE  AND  TSATS  POLYNOMIALS  FOR  FIGURE  3.6. 


Data  Structure: 

Type 

A  =  RECORD  L  =  RECORD 


il 
S 
fl 
f2 
END; 


integer;  fl 

array  [1  ..  80]  of  char;             f2 

AD;  cl 

AL;  END; 


AL; 
AL; 
char; 


D  =  RECORD 


fl 

rl 

f2 

END 


AD; 

real; 

AG; 


G  =  RECORD 
fl:    AA; 

END; 

Self-Complexity  Monomials: 

ilinteger;  =    .^subrange;  =    5fchar;  =    Sfreal)  =    S$)  =    1 

S(L)  =    (1+2)X2    =    3X2 

S(k)  =  S(D)  =    StG)  =    (3+4)  X2    =  7X2 

Tsai's  Complexity  Polynomials: 

CiintegerJ  =    sfinteger)  =    1 
C(subrange)  =    ^subrange,)  =    1 
CfeharJ  =    jjchar;  =    1 
C(real;  =    5(realj  =    1 

C(S)  =    C/subrangej  +    OcharJ  +    S@)  =    3 
C(L)  =    C(chai)  +    S(L)  =    1+3X2 

C(k)  =    C(D)  =    C(G)  =    Clinteger;  +    CIS)  +    flL)  +    Cfrealj  +   s(k) 
=    1  +  3  +  (1+3X2)  +  1  +  7X2    =    6  +  10X2 
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Figure  3.7 
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DATA  STRUCTURE  AND  TSAI'S  POLYNOMIALS  FOR  FIGURE  3.7. 


ata  Structure: 

Type 

A  =  RECORD 

L  =  RECORD 

il 

integer; 

fl:    AL; 

S 

array  [1  ..  80]  of  char; 

f2:    AL; 

fl 

AD; 

cl :    char; 

Q 

AL; 

END; 

f3 

AL; 

END; 

D  =  RECORD 

fl 

AD; 

rl 

real; 

f2 

AG; 

END; 

G  =  RECORD 

fl:    AA; 

END 

Self-Complexity  Monomials: 


S(char)  =    S(real)  =    SIS) 


Slinteger)  =    .s/subrangej 

SlL)  =    (1+2)X2    =    3X2 

S(k)  =  SlD)  =    SlQ)  =    (3+4)  X2    =  7X2 

Tsai's  Complexity  Polynomials: 

ClintegerJ  =    5(integerj  =    1 
Cl&ubrange)  =    .^subrange)  =    1 
C(char)  =    Slchar)  =    1 
Orealj  =    jfreal;  =    1 

C(&)  =    C/subrange;  +    Cfchar;  +    S(S)  =    3 
C(L)  =    Qcharj  +    SlL)  =    1+3X2 
C(A)  =    C(D)  =    C(G)  =    Cfinteger;  +    C(S)  + 
=    1  +  3  +  2  *  (1+3X2)  +  1  +  7X2    =    7 


2  *  cil; 

+  13X2 


+    CirealJ  +  5(AJ 
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Figure  3.8 
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DATA  STRUCTURE  AND  TSATS  POLYNOMIALS  FOR  FIGURE  3.8. 


Data  Structure: 

Type 

A  =  RECORD 

L  =  RECORD 

il 

integer; 

fl:    AL; 

S 

array  [1  ..  80]  of  char;             f2  :    AL; 

fl 

AD; 

cl :    char; 

Q 

AL; 

END; 

£3 

AL; 

f4 

AM; 

N  =  RECORD 

END; 

fl:    AM; 

p  :    set  of  char 

D  =  RECORD 

END; 

fl 

aD. 

rl 

real; 

G  =  RECORD 

f2 

AG; 

fl:    AA; 

END- 

END; 

Self-Complexity  Monomials: 

5(integer/i  =    ^subrange,)  =    S(char)  = 

S(L)  =    (1+2)X2    =    3X2 

SW  =    S(N)  =    (2+2)X    =    4X 

StA)  =   S(D)  =    5(G;  =    (3+4)  X2    =  7X2 

Tsai's  Complexity  Polynomials: 


flinteger)  =    5(integerJ  =    1 

Cl&ubrange)  =    5(subrange,/  =    1 

C(char)  =    S(char)  =    1 

CirealJ  =    Sfreal)  =    1 

C(S;  =    ^subrange;  +    Cieharj  +    5(Sj  =    3 

C(L)  =    qcharj  +    s(L)  =    1+3X2 

CP)  =    SP)  =    1 

C(M)  =    CfNJ  =    CP)  +  S/M)  =    1+4X 

C(k)  =    C(D)  =    c/g; 

=    Cfintegerj  +  C(Sj  +  CM)  +  2  *  tfL;  +  5<AJ 
=    1  +  3  +  (1+4X)  +  2  *  (1+3X2)  +  7X2    = 


Streed)  =    5(S;  =    SP)  =    1 


7  +  4X  +  13X2 
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the   following   diagrams,    CW  gives   the   data   structure   complexity   of  the 
component  that  contains  the  node  A  [also  nodes  ©  and  Q]  as  well  as  the  data 
structure  complexity  of  the  entire  data  structure  definition. 

Note  that  in  the  multigraph  in  Figure  3.4,  node  L  will  be  removed  in 
the  process  of  deleting  auxiliary  definitions,  and  replaced  by  an  edge  from 
node  A  to  node  "Char".    Consequently,  CW  yields  6+7x2  rather  than  7+7x2. 
3.3.  PROPERTIES  OF  TSAI'S  POLYNOMIALS: 

This  measure  quantifies  the  structural  complexity  of  data,  emphasizing 
the  dynamic  part  of  a  data  structure  over  the  static  part.  The  number  of 
data  elements  in  the  data  structure  has  only  a  small  influence  on  the 
complexity  measure.    It  measures  the  structure  of  data  and  not  the  size  of  it. 

The  measure,  in  the  words  of  Tsai,  et.  al.,  is  intuitive.  Every  reference 
of  one  definition  to  another  definition  (represented  by  an  edge  between  two 
nodes),  and  every  instance  of  such  definitions  (represented  by  a  node) 
increases  the  measured  complexity  of  data  by  increasing  the  coefficients  of  the 
resulting  polynomials.  Circular  references  contribute  much  more  to  the 
measure  by  also  affecting  the  order  of  magnitude  of  the  complexity  measure. 

This  measure  is  consistent,  that  is,  if  data  structure  a:  is  a  substructure 
of  a  data  structure  y,  then  complexity^  £  complexity^.  The  measure  tolerates 
incomplete  information  since  it  is  possible  to  compute  the  measure  at  the  very 
start  of  software  design  when  not  all  the  decisions  have  been  made.  Addition- 
ally, this  measure  is  insensitive  to  language-specific  details  to  some  degree. 


Chapter  4. 


A  MAPPING  MEASURE 


4.1.  A  PROCESS  OF  ABSTRACTION: 

It  is  well  known  that  most  complexity  measures  attempt  to  arrive  at  a 
real  number  that  represents  the  complexity  of  a  program  in  whatever  sense 
they  are  measured.  McCabe's  and  Halstead's  measures  are  two  prominent 
examples.  In  studying  complexity,  an  abstraction  that  generalizes  programs 
is  devised  as  the  first  step.  Complexity  measures  may  be  computed  on  the 
basis  of  such  an  abstraction.  For  example,  the  abstraction  on  which  McCabe 
bases  his  cyclomatic  number  is  none  other  than  the  familiar  flowgraphs. 
However,  the  interest  is  in  producing  measures  that  either  map  the  abstrac- 
tion directly  or  indirectly  onto  real  numbers.  Nevertheless,  such  mapped  real 
numbers  do  not  carry  much  significance  unless  they  obey  some  kind  of 
mathematical  ordering  such  as  preorder,   partial  order,   and  Cintar  order.    Figure  4.1 
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depicts  one  desired  mapping  of  the  abstraction  onto  real  numbers  wherein  the 
mapping  mechanism  preserves  partial  order. 

Orderings  of  measures  make  them  comparaBte.  In  case  of  linear  orders, 
every  pair  of  elements  is  comparable.  However,  linear  orders  are  too 
restrictive  to  be  of  much  value.  Partial  orders  facilitate  comparison  of  two 
elements.  However,  not  every  pair  of  elements  of  a  partially  ordered  set  is 
comparable.  Preordering  is  the  least  restrictive  of  the  three  requiring  the 
relation  to  be  reflexive  and  transitive. 

We  started  with  programs  that  contained  no  statements  that  may  have 
caused  them  to  be  unstructured.  Secondly,  we  took  the  usual  route  in 
abstracting  programs  into  flowgraphs.  Since  the  programs  are  structured  the 
flowgraphs  are  also  structured.  We  will  show  later  that  the  set  of  structured 
flowgraphs  and  a  relation  f  is  a  partial  order  [see  Chapter  5].     However, 

characteristic  polynomials,  which  are  computed  for  structured  flowgraphs,  are 
complex  polynomials  and  are  not  members  of  the  set  CI.  As  noted  above,  this 
causes  difficulties  in  directly  comparing  characteristic  polynomials.  Therefore, 
in  this  study  we  will  extend  the  manner  in  which  an  abstraction  of  a  program 
is  mapped  to  real  numbers  by  incorporating  a  mapping  measure  [Figure  4.2]. 
EXAMPLE  CONTINUED: 

The  characteristic  polynomials  computed  for  the  illustrative  flowgraphs 
in  Chapter  2  were  found  to  be  difficult  to  compare  directly.  In  order  to 
enable  comparisons  between  characteristic  polynomials,  a  few  simple  measures 
were   devised   that   map   the   characteristic   polynomial   expressions   to   real 
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numbers.    It  will  be  shown  later  that  only  one  of  the  four  simple  mapping 
measures  [5W5K]  constructed  in  this  study  preserve  the  ordering. 

The  four  measures  provided  in  Table  4.1  map  the  polynomials  to  real 
numbers.  Furnishing  explanations  of  the  various  measures  seems  to  result  in 
reasonable,  albeit  crude,  descriptions. 

la,       as  was  indicated  earlier,  the  first  measure,  namely,  la,  is  the 

number  of  exemplary  executions.   So,  is  nothing  but  the  total number 

of  paths  [from  the  start  node  to  the  terminal  node]  of  the  digraph. 

Sctj      variable    c  is    significant   only   when   there   are    cyclic    control 

structures  in  a  flowgraph.      Hence,  variable  c  along  with  its 

exponent  indicates  the  nestedness  of  a  flowgraph.    Perhaps,  this 

sum  could  be  treated  as  the  crude  sum  of  nestedness.   Note  that  very 

many  different  polynomials  can  produce  this  real  number  and, 

therefore,  it  is  a  rather  poor  mapping  measure. 

lo/aj  It  is  simple  to  perceive  that  perhaps  la*  must  be  considered  in 

tandem  with  Za,  as  a  mapping  measure.     This  measure  is  the 

number  of  paths  weighted  by  its  nestedness. 

Za,(a.i+ 1)  Nearly  the  same  as  the  measure  above  with  the  exception  that 

the  exponent  of  the  variable  is  incremented  by  unity  before  using 

it  as  a  weight  to  the  number  of  paths. 

As  noted  earlier,  the  flowgraphs  illustrated  in  Chapter  2  were  generated 

utilizing  the  flowgraph  transformations  and,  clearly,  they  were  increasingly 

complex.   It  is  not  clear,  however,  whether  the  characteristic  polynomials  built 
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Characteristic 
Polynomial 

(1) 
la, 

(2) 

(3) 
EaiCa,) 

(4) 
Sa,(o,+l) 

1.  4c 

4 

1 

4 

8 

2.  5c 

5 

1 

5 

10 

3.  6c2  +  2c 

8 

3 

14 

20 

4.  6c2  +  2c  +  1 

9 

3 

14 

23 

5.  6c2  +  4c 

10 

3 

16 

26 

6.  10c2 

10 

2 

20 

30 

7.  4c3  +  8c2 

12 

5 

28 

40 

8.  8c3  +  6c2 

14 

5 

36 

50 

9.  8c4  +  6c3 

14 

7 

50 

64 

Table  4.1.  Mapping  Measures  for  Characteristic  Polynomials. 
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for  the  increasingly  complex  flowgraphs  demonstrate  any  such  gain  in  source 
code  complexity.  That  is,  it  is  not  always  easy  to  compare  two  characteristic 
polynomials-without  the  aid  of  related  flowgraphs-generated  for  transformed 
flowgraphs  and  remark  if  one  characteristic  polynomial  is  more  complex  than 
the  other. 

It  is  for  this  reason  why  an  attempt  was  made  to  find  measures  that 
would  map  characteristic  polynomials  to  real  numbers  and  hence  facilitate 
comparison.  The  anticipation  is  that  any  such  measure  would  map  increas- 
ingly complex  characteristic  polynomials,  due  to  increasingly  complex  flow- 
graphs,  onto  increasingly  larger  real  numbers.  It  is  important  to  establish 
that  the  mapping  also  preserves  partial  order  if  any  comparison  of  the  resulting  real 
numbers  is  desired.  Indeed,  one  of  the  goals  of  the  software  complexity 
measures  research  is  to  find  functions  [measures]  that  preserve  these  orders. 
Therefore,  it  would  be  gratifying  to  see  that  more  complex  flowgraphs  are 
always  mapped  with  correspondingly  larger  real  numbers  and,  at  the  same 
time,  the  mapping  measures  preserve  partial  orders. 

Table  4.2  presents  a  comparative  picture  of  the  characteristic  polynom- 
ials computed  for  the  increasingly  complex  structured  flowgraphs  [given  in 
Figure  2.1  through  Figure  2.9]  in  terms  of  the  mapping  measures  [furnished 
in  Table  4.1].  Note  that  it  is  useless  to  compare  mapping  measures  (1)  and 
(2)  on  the  polynomials.  Instead,  they  are  combined  together  to  check  how 
they  both  map  the  polynomial  expressions7. 


This  comparison  could  in  turn  be  performed  in  terms  of  vectors.  For  example,  comparing 
characteristic  polynomials  1  and  2  can  be  written  as  [2,1]  <  [3,1]  instead  of  2  <  3  &  1  = 
1,  etc. 
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Characteristic 
Polynomials 

COMPARISONS 

(1)  &  (2) 
Za,  £  lb, 

& 
Zcq  <  IB, 

(3) 
ZaA 

<. 

1 

(4) 
lato+l) 

<, 
Ib,(B,+l) 

1  &  2.  4c  vs.  5c 

2  &  3.  5c  vs.  6c2+2c 

3  &  4.  6c2+2c  vs.  6c2+2c+l 

4  &  5.  6c2+2c+l  vs.  6(^+40 

5  &  6.  6c2+4c  vs.  10c2 

6  &  7.  10c2  vs.  4c3  +  8c2 

7  &  8.  4c3+8c2  vs.  Sd'+6c' 

8  &  9.  8c3+6c2  vs.  8c'+6c3 

4  <    5  &  1  =  1 

5  <    8  &  1  <  3 

8  <    9  &  3  =  3 

9  <  10  &  3  =  3 
10  =  10  &  3  £  2 
10  <  12  &  2  £  5 
12  <  14  &  5  =  5 
14  =  14  &  5  <  7 

4  <    5 

5  <  14 
14  =  14 
14  <  16 
16  <  20 
20  <  28 
28  <  36 
36  <  50 

8  <  10 
10  <  20 
20  <  23 
23  <  26 
26  <  30 
30  <  40 
40  <  50 
50  <  64 

[a,  and  a,  refer  to  the  coefficient  i  and  exponent  t  of  the  first  characteristic 
polynomial  and  ft  and  fi  refer  to  the  coefficient  i  and  exponent  i  of  the  second 
characteristic  polynomial  of  each  comparison.] 

Table  4.2.    Comparisons  of  Characteristic  Polynomials. 
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Overlooking  preservation  of  partial  orders  for  the  time  being,  it  is 
apparent  that  the  combined  mapping  measure  of  (1)  and  (2)  as  well  as  (3)  do 
not  consistently  produce  larger  real  numbers  for  more  complex  characteristic 
polynomials.  [Compare  the  characteristic  polynomials  5  and  6  as  well  as  8 
and  9  with  respect  to  the  combined  measure  (1)  &  (2)  and  the  polynomials  3 
and  4  with  respect  to  the  mapping  measure  (3).]  Mapping  measure  (4),  on 
the  other  hand  seems  to  achieve  the  mapping  properly.  The  fact  that  this 
mapping  measure  does  map  characteristic  polynomials  of  more  complex 
flowgraphs  onto  larger  real  numbers  will  be  mathematically  established  later. 
Additionally,  it  preserves  the  partial  order  as  well,  as  will  be  established 
below. 


Chapter  5. 

EQUIVALENCE  CLASSES, 

PARTIAL  ORDERS, 

AND 

CHARACTERISTIC  POLYNOMIALS 


'Dcjimtian.  S.i:  Let  A  be  a  set  of  structured  fiowgraphs.    For  any  two  structured 
flowgraphs,  Q,  e  A  and  g,  e  H  there  exists  a  relation  §,  tf  Qv  if  £ 

can  be  obtained  from  £,  by  a  finite,  £>  0,  sequence  of  applications 
of  structured  flowgraph  transformations,  viz.,  composition, 
alternation,  and  iteration  transformations. 
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5.1.  STRUCTURED  FLOWGRAPH  TRANSFORMATIONS  AND 
EQUIVALENCE  CLASSES: 

Given  characteristic  polynomials,  the  consideration  is  regarding  the 
existence  of  any  equivalence  classes  and/or  partial  orders  related  to  charac- 
teristic polynomials.  The  outcome  of  structured  flowgraph  transformations  on 
equivalence  classes  is  examined  below. 

Starting  with  a  structured  flowgraph  and  applying  the  composition 
transformation,  it  is  easy  to  find  that  the  transformed  flowgraph  does  not 
belong  in  the  same  equivalence  class.  Composition  destroys  the  symmetry 
property.  Given  two  structured  flowgraphs  g,  =  (%,'E1,s„tJ  and  gz  =  (^"E^s^tJ, 
composition  results  in  a  new  structured  flowgraph,  g,  =  (%  11  J^£,  11  'E2,s1,tJ 
with  the  restriction  that  the  exit  node  of  g,  and  the  entry  node  of  g,  become 
a  single  new  junction  node.  Although  both  Q,  and  g,  comply  with  reflexive 
and  transitive  properties,  they  do  not  satisfy  the  property  of  symmetry  since 
while  an  edge  such  as  {svk)  e  gv  where  k.  e  "H*  (*vK)  t  Qt-  We  can,  therefore, 
conclude  that  flowgraph  transformations  do  not  result  in  preserving  equiv- 
alence ordering.    In  other  words,  the  relation  f  as  defined  above  is  not  an 

equivalence  ordering. 

5.2.  STRUCTURED  FLOWGRAPH  TRANSFORMATIONS  AND  PARTIAL 
ORDERS: 

Though  the  relation  jFis  not  an  equivalence  ordering,  it  will  be  shown 

below  that  it  is  a  partial  ordering.    Proving  f  is,  a  partial  ordering  involves 
showing  that  reflexivity,  antisymmetry,  and  transitivity  properties  are  still 
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met  after  any  of  the  three  possible  transformations.  Assume  three  structured 
flowgraphs  g,  =  (!>(1,'EI,s„tJ,  Q,  =  (Hi/E^s^u),  and  Q,  =  (%£„.(„ tj  where  g,  is  combined 
with  Qi  using  composition,  alternation,  or  iteration  transformations  resulting 
in  a  new  structured  flowgraph,  gllt  and  Q2  is  combined  with  g,  producing  a 
flowgraph,  gu.  Employing  similar  notations,  combining  gv  with  gu  will  be 
written  giM. 

Clearly  g,  J  g,  and  g2  f  g,  immaterial  of  the  application  of  composition, 

alternation,  or  iteration  transformation.    Therefore,  reflexivity  property  holds. 

With  any  of  the  three  structured  flowgraph  transformations,  unless 

when  t=  0,  gt  7  §u  but  gu  f  g,.  When  i=  0,  g,  7  gu  and  gu  f  g,  implying 

that  g,  =  gu.  Therefore,  relation  fFis  antisymmetric. 

For  observing  transitivity  property  assume  that  the  structured  flowgraph 
g,  is  combined  using  any  of  the  structured  flowgraph  transformations  with  g, 
producing  gu.  Further  assume  that  the  structured  flowgraphs  gu  and  g,  are 
combined  producing  a  new  flowgraph  gUJ.  We  have  argued  above  that  g,  7  gu 

and  §u  f  gaj  for  any  finite  sequence  of  structured  flowgraph  transformations. 

It  is  easy  to  see  that  if  g,  can  be  transformed  into  §u  using  g,  in  some  finite 
sequence  of  transformations  and  gu  can  similarly  be  transformed  into  gai  using 
g,  in  some  finite  applications  of  transformations,  then  gt  can  be  transformed 
into  gaj  using  g2  and  g,  in  some  finite  transformations.    Therefore,  g,  J  g!2J. 
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This  implies  that  if  g,  J  gu  and  ga  $  Qu*  then  g,  f  gaj  and  hence  f  is 

transitive. 

Since  f  is  reflexive,  antisymmetric,  and  transitive  it  is  a  partial  order. 

In  other  words,  structured  flowgraph  transformations  preserve  the  partial 
order  of  the  relation  f.     Summarizing  the  results  one  can  arrive  at  the 

following  definition: 

(Definition  52:  The  relation  J  on  a  set  of  structured  flowgraphs  A  is  a  partial 

order  since  jFis  reflexive,  antisymmetric,  and  transitive.   In  other 

words,  (M,!F)  is  a  poset. 

5.3  PROOF  THAT  CHARACTERISTIC  POLYNOMIALS  PRESERVE 
PARTIAL  ORDER: 

(Definition  S3:  Assume  a  set  of  structured  flowgraphs  A  that  were  generated 
using  structured  flowgraph  transformations.  Further,  let  C£  be 
the  set  of  characteristic  polynomials  computed  for  it  Then,  the 
relation  $  on  CB  is  a  partial  ordering  and  IC2,J)  is  a  posit. 

In  order  to  prove  that  the  characteristic  polynomials  preserve  the 
ordering  we  would  show  that  the  characteristic  polynomial  of  a  flowgraph  is 
contained  in  the  characteristic  polynomial  of  the  transformed  flowgraph.  Let 
CS.  be  the  characteristic  polynomial  of  a  flowgraph  [the  characteristic  poly- 
nomial of  the  transformed  flowgraph  is  denoted  CSl.    Then,  it  can  be  written 
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a,  a,  ctt  a. 

CB   -   a   c        +   a   c        +    ...  +   a   c        +    ...        +   a   c  (1) 

o        1  2  £  it 


Lac  where  both  o  £  0  and  a  >  0  and 

i=i  i 

are  both  integers. 

COMPOSITION  TRANSFORMATION: 

By  applying  a  composition  transformation,  a  composite  structure  is 

obtained  by  sequentially  combining  two  structured  flowgraphs.   The  exit  node 

of  one  structured  flowgraph  is  made  to  coincide  with  the  entry  node  of  another 

structured  flowgraph.  Each  of  these  structured  flowgraphs  has  a  characteristic 

polynomial  of  its  own.    The  computation  of  characteristic  polynomial  measure 

dictates  that  the  characteristic  polynomial  of  the  transformed  flowgraph  will 

be  the  product  of  the  characteristic  polynomials  of  the  two  separate  structured 

flowgraphs.    If  the  characteristic  polynomials  of  the  two  individual  structured 

flowgraphs  are  assumed  to  be 

n        a, 
CS  =       "Lac  and  a) 

1  W  i  '  ' 


CS  =       lit  where  a  >  0,  6  i  0,  a  >  0,  and  /  >  0  (3) 

J~    J 

and  are  all  integers. 

then  the  characteristic  polynomial  of  the  composed  flowgraph  is: 

CS  -  /  %  a  c        1    .    [  %  i  Z1     J.  (4) 

t  i=l  i  j=i  j 
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which  is  larger  than  (l).  Hence,  CS,  <c  C2„  that  is,  the  characteristic  polynomial 
of  the  initial  flowgraph  is  contained  in  the  characteristic  polynomial  of  the 
transformed  flowgraph. 
ALTERNATION  TRANSFORMATION: 

Alternation  transformation  lets  us  combine  two  structured  flowgraphs 
by  permitting  them  to  be  successors  of  a  common  condition  node  and  also 
letting  them  have  a  common  exit  node.  Once  again,  assume  that  the  two 
flowgraphs  have  the  characteristic  polynomials  indicated  in  (z)  and  (3)  respec- 
tively. Then,  using  the  fact  that  the  characteristic  polynomial  is  given  by  the 
sum  of  the  characteristic  numbers  of  the  structures  and  nodes  which  are 
contained  in  it,  the  new  characteristic  polynomial  of  the  transformed  flowgraph 
is  given  by: 

ft  o,         i     ,      I 

CS   =  £     a      c      +     I      6   c  '  IS) 

t      i=l      i  j=l     j 

which  implies  that  CS,  £,  CS,. 

ITERATION  TRANSFORMATION: 

The  transformed  structured  flowgraph  is  organized  such  that  an  original 

structured  flowgraph  is  placed  within  a  Coop  with  a  condition  node  at  its 

beginning  or  end  to  govern  the  loop's  termination.      Assuming  that  the 

characteristic  polynomial  of  the  initial  flowgraph  is  given  by   (Z),  the  new 

characteristic  polynomial  is  given  by  either: 

n  a, 

OS  -  /  X     ac     1    '  c  (6) 

t         i=i     i 

if  it  is  a  repeat-until  structure,  or  by: 
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n  a, 

d    •   I    2      a    c       +      1    1     '    c  (7) 

t  i=l      i 

if  it  is  a  while-do  structure. 

This  indicates  that  C£.  <,,  CB,  meaning  that  the  characteristic  polynomial 
of  the  initial  flowgraph  is  contained  in  the  new  characteristic  polynomial 
following  an  iteration  transformation. 
5.4  PROOF  THAT  lafa+i)  PRESERVES  THE  PARTIAL  ORDER: 

In  order  to  provide  a  proof  that  Zoila.,+1)  preserves  the  order,  we  will 
borrow  the  results  from  the  previous  section.  Showing  that  the  mapping  of 
the  characteristic  polynomial  to  real  numbers  using  the  mapping  measure 
Znta.i+1)  preserves  partial  order  is,  again,  a  simple  mathematical  exercise.  This 
exercise  is  also  carried  out  in  terms  of  composition,  alternation,  and  iteration 
transformations. 

As  we  know,  the  generalized  characteristic  polynomial  for  the  original 
structured  flowgraph  is  given  by: 

S  a< 

CS   =   Z     a   c  (2) 

and  that  of  another  structured  flowgraph  with  which  the  original  flowgraph 
is  composed  is  given  by: 

m  fi 

OS  -  .J     b.  c  where  a  £  0,  b  >  0,  a  >  0,  and  /  >  0  (3) 

and  are  all  integers. 

and  the  mapping  measure  of  the  original  flowgraph  is  given  by: 

n 

MM=  i     a  (a  +i)  (g) 
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Following   the   composition   of  these   two   structured   flowgraphs,   the   new 

characteristic  polynomial  becomes  (4),  which  is: 

a  a,  m         s, 

<&   -  /      1      a    c    1    .     I      16    c'\.  (4) 

t  »=I      i  j=l  j 

This  characteristic  polynomial  (4)  can  be  shown  to  have  the  following  mapping 

measure: 

n  m  ■  . 

<MM=  1     a   .      £     b    (a+f  +  1)  19) 

i=l      i        j=l      j       i    j 

which    is   larger    than    the    mapping   measure    for    the    original    structured 

flowgraph  as  indicated  in  (8). 

In    case    of   an    alternation    transformation,    the    new    characteristic 

polynomial  is  given  by  IS)  and  its  new  mapping  measure  is  given  by: 

n  m 

MM=     Z     a    (a  +1)    +      I     6(8   +1)  (10) 

(=1      i      i  j=l     j      j 

which  is  larger  than  (8)  by  its  second  term. 

Iteration  transformation  also  raises  the  size  of  the  mapping  measure. 
Note  that  an  iteration  transformation  alters  the  characteristic  polynomial  to 
either  (6)  or  (7).  The  mapping  measure  corresponding  to  these  new  characteris- 
tic polynomial  expressions  are  given  by  (11) and  irrespectively: 

MM=     £     a   (a  +2)  iU) 

i=l      i      i  '    ' 

n 
MM=     I.     a   (a  +2)    +  2  (12) 

each  of  which  is  larger  than  the  mapping  measure  of  the  original  structured 
flowgraph  (8).  Therefore,  any  increase  in  complexity  of  the  original  flowgraph 
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due  to  composition,  alternation,  and/or  iteration  transformations  results  in  a 
larger  real  value  and  hence  proves  that  the  mapping  measure  (8)  does  preserve 
partial  order. 
5.5  SYNOPSIS: 

We  noted  that  if  a  structured  flowgraph  g2  can  be  obtained  from 
structured  flowgraph  g,  using  structured  flowgraph  transformations,  then  the 
set  of  structured  flowgraphs  and  the  relation  7  [see  the  start  of  Chapter  5  for 

a  definition  of  J\  is  a  partial  order. 

Characteristic  polynomials  were  then  constructed  for  structured 
flowgraphs  and  were  shown  to  preserve  the  order.  Since  the  polynomials  did 
not  lend  themselves  to  direct  comparison,  a  mapping  measure  (MM)  was 
devised.  As  we  have  shown  above,  the  mapping  measure,  svfiW,  preserved  the 
partial  order  while  facilitating  direct  comparison. 

It  must  be  noted  that  MM(gj  is  comparable  to  MfMigj  or  that  CB,  is 
comparable  to  CS,  [assuming  that  CP,  and  CP2  are  the  characteristic  polynomials 
constructed  for  the  structured  flowgraphs  Q,  and  gu  respectively]  if  and  only 
if  the  structured  flowgraph  g2  is  constructed  by  applying  structured  flowgraph 
transformations  on  the  structured  flowgraph  g,.  For  two  unrelated  structured 
flowgraphs,  Q,  and  gu  it  is  not  possible  to  say  if  CP,  and  CS2  are  comparable. 

In  order  to  validate  characteristic  polynomials  utilizing  the  software 
complexity  measure  validation  paradigm  given  in  Appendix  B,  let  m  be  a 
[hypothetical]  software  complexity  measure  which  is  strictly  monotone  with 
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respect  to  the  f  partial  order.    Then  if  o  and  6  are  programs  with  structured 

flowgraphs  Q,  and  Q„  respectively,  and  m(a)  2  Mi),  then  we  cannot  have  that 
MM(QJ  >  MMlQj.  That  is,  characteristic  polynomials,  as  a  complexity  measure, 
is  strictly  monotone  with  respect  to  the  JF  preordering  of  all  structured 
flowgraphs. 


Chapter  6 

EQUIVALENCE  CLASSES, 

PARTIAL  ORDERS, 

AND 

TSATS  POLYNOMIALS 


As  was  with  characteristic  polynomials,  the  usefulness  of  the  data 
structure  complexity  polynomial  depends  on  whether  it  preserves  any  mathe- 
matical ordering.  In  order  to  check  the  existence  of  equivalence  classes  and 
partial  orders  in  Tsai's  polynomials,  we  can  once  again  begin  with  the 
definition  of  a  relation  [see  Definition  6.1  below]  similar  to  the  one  employed 
for  flowgraphs  in  Chapter  5.  Note  that  Definition  6.1  is  expressed  in  terms 
of  multigraph  transformations  discussed  in  Appendix  A.    It  may  be  recalled 
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that  Definition  5.1  was  stated  in  terms  of  structured  flowgraph  transforma- 
tions. 

•Definition  6.1:  Let  J?  be  a  set  of  directed  multigraphs.   For  any  two  multigraphs, 
M,  e  A  and  M,  e  a  there  exist  a  relation  SW,  Q  <MV  if  rW,  can  be 

obtained  from  M,  by  a  finite,  ^  0,  sequence  of  applications  of 
directed  multigraph  transformations,  viz.,  node  addition,  node 
replication,  and  edge  addition  transformations. 
6.1  MULTIGRAPH  TRANSFORMATIONS  AND  EQUIVALENCE 
CLASSES: 

Node  addition  transformations  are  possible  only  from  non-atomic  nodes. 
The  added  node  may  be  either  atomic  or  non-atomic.  However,  in  order  to  be 
complete,  if  the  added  node  is  non-atomic  path(s)  must  be  provided  from  the 
new  non-atomic  node  to  atomic  node(s).  In  any  case,  a  node  addition 
transformation  modifies  the  directed  multigraph  and  the  new  multigraph  no 
longer  belongs  in  the  same  equivalence  class. 

A  node  replication  transformation  applied  on  a  multigraph  generates  a 
directed  multigraph  with  a  new  awdOwy  definition  contained  in  it.  As  noted 
from  the  method  of  computing  Tsai's  polynomial  we  know  that  any  such 
node(s)  do  not  contribute  to  the  data  structure  complexity  and  can  effectively 
be  removed.  Hence,  the  original  directed  multigraph  and  the  new  multigraph 
with  one  or  more  node  replication  transformations  applied  on  it  are  equivalent. 
Therefore,  node  replication  transformation  retains  the  directed  multigraph  in 
the  same  equivalence  class. 
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The  effect  of  edge  addition  transformation  on  directed  multigraphs  is 
more  complex.  Since  a  path  from  an  atomic  definition  to  a  non-atomic 
definition  will  have  no  correspondence  to  a  possible  data  structure  definition, 
edge  additions  will  be  restricted  between  non-atomic  nodes  or  from  non-atomic 
nodes  to  atomic  nodes. 

Edge  addition  transformations  can  be  distinguished  into  forward  edge 
addition  transformation  and  Backward  edge  addition  transformation.  In  case  of  forward  edge 
additions,  an  edge  which  originates  in  a  non-atomic  node  points  at  an  atomic 
node  or  the  edge  originates  in  a  node  which  must  have  been  declared  before 
the  node  which  the  edge  is  pointing  to.  On  the  other  hand,  backward  edge 
addition  transformation  is  one  where  the  edge  originates  at  a  node  whose 
declaration  was  made  later  than  the  node  it  points  at.  In  essence,  backward 
edge  addition  transformation  causes  a  circular  definition  of  data.  Note  that 
there  can  not  be  a  backward  edge  addition  transformation  from  an  atomic 
node  to  any  other  node. 

Edge  additions  between  non-atomic  nodes  may  be  either  forward  edge 
addition  transformation  or  backward  edge  addition  transformation  whereas 
edge  addition  transformation  from  a  non-atomic  node  to  an  atomic  node  is 
always  forward  edge  addition.  Forward  edge  addition  does  not  change  the 
number  of  strongly  connected  components  but  it  increases  references  [non- 
cyclic]  to  other  data  structures.  In  the  case  of  backward  edge  addition,  new 
cyclic  references  are  created.  This  may  reduce  the  number  of  strongly 
connected  components  in  the  entire  data  structure.  Note  that  the  reduction 
in  the  number  of  strongly  connected  components  is  due  to  the  merger  of  two 
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or  more   previously  strongly  connected   components.      In   either   case,   edge 
addition  transformations  create  multigraphs  which  do  not  belong  in  the  same 
equivalence  class. 

The  relation  Q,  as  defined  in  Definition  6.1,  hence  is  not  an  equivalence 

ordering.  Node  replication  transformation  places  the  new  multigraph  in  the 
same  equivalence  class  and  satisfies  the  reflexivity,  symmetry,  and  transitivity 
properties.  However,  the  other  two  flowgraph  transformations  do  not.  This 
result  is  quite  the  same  as  in  the  case  of  transformations  applied  on  flow- 
graphs. 
6.2  MULTIGRAPH  TRANSFORMATIONS  AND  PARTIAL  ORDERS: 

The  relation  Q  is,  nevertheless,  a  partial  ordering.   The  results  obtained 

here,  again,  are  comparable  to  the  preservation  of  partial  order  by  structured 
flowgraph  transformations  applied  on  flowgraphs.  Assume  three  directed 
multigraphs  5K)  =  WJEJLJ,  M2  =  (t^xjj,  and  M,  =  {<Vy%JLj  where  multigraph  M, 
is  transformed  into  M,  using  any  of  the  directed  multigraph  transformations 
and  similarly  multigraph  04,  from  <M,. 

Clearly  Mt  Q  Mu  ttls  Q  %  and  M,  Q  M,  immaterial  of  the  apphcation  of 

node  addition,  node  rephcation,  or  edge  addition  transformations.  Therefore, 
reflexivity  property  holds. 

Node  replication  transformation  was  shown  to  retain  the  multigraphs  in 
the  same  equivalence  class,  i.e.,  M,  is  the  same  as  5W2  since  the  replicated  node 
will  be  removed.  Hence,  <M,  =  Mv  which  implies  that  node  replication  is 
antisymmetric.    With  any  of  the  other  two  multigraph  transformations,  unless 
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when  k.  =  0,  M,  Q  M,  but  Mz  <$  M,.   When  H  =  0,  M,  and  Mz  are  one  and  the 

same.    Therefore,  relation  Q  is  antisymmetric. 

If  M,  Q  %  and  M,  §  <H,  then  5W,  ^  M,  must  be  proved  to  show  the 
property  of  transitivity.  This  can  be  done  by  stating  that  if  M,  is  transformed 
into  Mt  and  2W,  into  M,  each  in  some  finite  sequence  of  multigraph  transforma- 
tions, then  it  is  possible  to  transform  iW,  into  3K)  in  some  finite  sequence  of 
transformations  and  hence  iM;  Q  My 

Since  Q  is  reflexive,  antisymmetric,  and  transitive  it  is  a  partial  order. 
In  other  words,  multigraph  transformations  preserve  the  partial  order  of  the 
relation  Q.    Summarizing  the  results  we  obtain  the  following  definition: 

(Definition  6-2:  The  relation  Q  on  a  set  of  directed  multigraphs  A  is  a  partial 

order  since  Q  is  reflexive,  antisymmetric,  and  transitive.   In  other 

words,  (Jl§)  is  a  poset. 

It  is  still  necessary  to  show  that  Tsai's  data  structure  complexity 
measure  preserves  the  ordering.  Although  Tsai's  polynomials  resemble 
characteristic  polynomials,  preservation  of  ordering  needs  to  be  proved  since 
the  method  with  which  Tsai's  polynomials  are  constructed  differs  from  that  for 
characteristic  polynomials. 
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6.3    PROOF    THAT    TSAI'S    POLYNOMIALS    PRESERVE    PARTIAL 
ORDER: 

•Definition  63:  Assume  a  set  of  directed  multigraphs  A  that  were  generated  using 
directed  multigraph  transformations.  Further,  let  C(A)be  the  set 
of  Tsai's  polynomials  computed  for  x  Then,  the  relation  Q  on  C{X) 

is  a  partial  ordering  and  (CMQ)  is  a  poset. 

Tsai's  data  structure  polynomials,   as  computed  from  directed  multi- 
graphs,  can  be  written  as  follows: 

cm  =  cm)     +  cm       +  ...  +  cm     +  s(a)  (i3) 

or 

C(X)   =  .1      Cfa)        +   S(A)  (14) 

where  C&D  refers  to  the  complexity  polynomial  of  the  vertex  that  corresponds 
to  the  entire  data  structure  A,  CWto  the  complexity  polynomial  of  the  strongly 
connected  component  A  of  the  data  structure,  and  s(A)  to  the  self-complexity 
monomial  of  the  data  structure.  Data  structure  A  is  assumed  to  contain  n 
strongly  connected  components.  Let  CMbe  the  Tsai's  polynomial  for  an  initial 
directed  multigraph  and  CM  that  of  the  transformed  multigraph  by  employing 
some  multigraph  transformations  on  the  initial  multigraph.  The  fact  that 
multigraph  transformations  preserve  the  partial  order  can  be  shown  by 
demonstrating  that  the  Tsai's  polynomial  of  the  initial  multigraph  is  contained 
in  the  Tsai's  polynomial  of  the  transformed  multigraph. 
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NODE  ADDITION  TRANSFORMATION: 

No  node  addition  could  be  made  from  an  atomic  node  and  is  always 
made  from  a  non-atomic  node.  Node  addition  transformation  always  creates 
a  new  strongly  connected  component  [n+l'th  component]  with  a  self-complexity 
monomial  and  complexity  polynomial  of  unity  in  addition  to  other  strongly 
connected  components  that  are  already  present.    That  is, 

CM  =  Z     COM       +   1  (is) 

i=l 

and  hence,  CJA)  £,  CM- 
NODE  REPLICATION  TRANSFORMATION: 

As  noted  earlier,  the  consequence  of  a  node  replication  transformation 
on  a  directed  multigraph  is  nil  since  the  auxiliary  definition  elimination  step 
of  the  Tsai's  polynomial  construction  process  eliminates  any  nodes  added 
through  the  node  replication  transformation. 

Hence,  the  Tsai's  data   structure  complexity  measures  of  the  initial 
directed  multigraph   and   the   transformed  multigraph   are   the   same   and 
consequently,  C.0V  ^.  C,(!\). 
EDGE  ADDITION  TRANSFORMATION: 

We  have  already  noted  that  edge  addition  transformations  can  not  be 
made  from  an  atomic  node  to  a  non-atomic  node  without  destroying  the 
features  of  the  programming  language.  As  before,  the  discussion  is  carried  on 
in  terms  of  forward  edge  addition  and  backward  edge  addition  transforma- 
tions. Furthermore,  unlike  structured  flowgraphs,  a  node  can  have  an  arrow 
pointing  back  to  itself.     Note  that  this  is  how  a  self-reference  of  a  data 
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structure  to  itself  is  depicted.    Such  circular  definitions,  where  a  node  has  an 
arc  pointing  back  to  itself-i.e.,  a  circular  path  of  length  one-are  grouped 
under  backward  edge  addition  transformations. 

FORWARD  EDGE  ADDITION  TRANSFORMATION: 

Case  1  --  Between  Two  Strongly  Connected  Components:     No  new  strongly 

connected  components  are  generated.    Since  the  edge  does  not  fall  within  a 

single  strongly  connected  component,  this  addition  has  no  impact  on  either  the 

self-complexity  monomial  or  on  the  complexity  polynomial.    Therefore,  C.W  < 

CM 

Case  2  --  Within  A  Strongly  Connected  Component:    Addition  of  one  edge 

raises  the  coefficient  of  the  self-complexity  monomial  of  the  strongly  connected 

component   by    unity.      The    polynomial    of  the    new    directed    multigraph, 

therefore,   is   larger  in   the   term   corresponding   to   the   strongly   connected 

component.    Hence,  C.(A)  £,  C,(A). 

BACKWARD  EDGE  ADDITION  TRANSFORMATION: 

Case  1  -  Between  More  Than  One  Strongly  Connected  Components:  These 

strongly    connected    components    merge    into    a    single    strongly    connected 

component.   The  self-complexity  monomial  of  the  merged  component  is  freshly 

calculated   and  is   not  the   sum   of  the   self-complexity  monomials   of  the 

individual  components  that  merged  into  one.    Let  nt  and  A2  be  two  strongly 

connected  components  with  self-complexity  monomials  S(AJ  and  S(XJ: 

A 
SM  =   (%  +  %  )    c  for  i=  1,  2.  (i6) 
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where  1^  is  the  number  of  nodes,  %,  the  number  of  edges,  and  £,  is  the  number 
of  simple  circular  paths  in  the  »'th  strongly  connected  component.  Assume 
that  these  two  components,  viz.,  H,  and  A2  are  linked  by  a  backward  edge 
addition  transformation.  Now  the  self-complexity  monomial  of  the  new 
strongly  connected  component,  call  it  S(A,  11  AJ,  becomes: 

Sfa    11  A,)    -  U  V,+  Vt)    +   (-£,+   55,  +   e)}   .cL'+L'+1  (17) 

where  %  %,  and  Lu  for  «=1,  2,  have  the  same  meanings  as  before  but  t  is  the 
number  of  forward  pointing  edges  from  A,  to  A,  or  from  A2  to  A,  which  were 
uncounted  because  A,  and  A,  were  two  separate  strongly  connected  components, 
and  c  is  raised  by  one  more  than  £,+£2  since  the  two  components  are  now 
linked  by  a  new  backward  edge. 

Comparing  expressions  (16)  and  (17),  the  self-complexity  monomials  of  the 
new  merged  component  has  a  higher  order  than  either  of  the  self-complexity 
monomials  of  A,  and  A,  individually  or  as  a  sum  of  them.  It  is  also  true  that 
the  coefficient  of  the  monomials  of  the  merged  component  is  larger  than  that 
of  either  of  S(AJ  and  S(AJ.  Therefore,  the  complexity  polynomial  of  the 
transformed  data  structure  is  larger  than  that  of  the  data  structure  before 
transformation. 

Note  that  a  backward  edge  addition  transformation  may  merge  more 
than  two  previously  strongly  connected  components.  However,  the  argument 
can  be  extended  to  cover  such  cases.  Tne  self-complexity  monomial  of  a 
merged  structure  when  more  than  two  previously  strongly  connected  com- 
ponents were  coalesced  together  can  be  written  as: 
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t  %     4  +    1) 
St    u    nj     =  I     2     %  +     1    %  +  i]  .    c  (is) 

j=i  M  i=l 

where  %  %,  and  U,  for  t=l,  2 n  have  the  same  meanings  as  before  but  e  is 

the  number  of  forward  pointing  edges  between  every  two  previously  strongly 
connected  components  A,  which  were  uncounted  because  -3,  were  separate 
strongly  connected  components.  The  variable  c  is  raised  by  one  more  than  IA 
since  all  these  components  are  now  linked  by  a  new  backward  edge. 
Case  2  --  Within  A  Strongly  Connected  Component:  The  self-complexity 
monomial  of  the  strongly  connected  component  is  raised  in  two  ways:  i)  the 
order  of  the  monomial  goes  up  by  one  because  the  backward  edge  indicates  a 
self-reference  to  itself  or  reference  to  a  parent  node,  and  ii)  the  coefficient 
increases  because  there  is  one  more  edge  now  than  before  the  transformation. 
Therefore,  CM  <..  CM 

Case  3  -  Backward  Edge  Addition  From  A  Node  To  Itself:    The  effect  on  the 
self-complexity  monomial  in  this  case  is  identical  to  Case  2. 

Hence,  we  have  shown  that  Tsai's  polynomial  preserves  the  order. 
6.4  SYNOPSIS: 

We  noted  that  if  a  directed  multigraph  M2  can  be  obtained  from  the 
directed  multigraph  9W,  using  directed  multigraph  transformations,  then  the  set 
of  directed  multdgraphs  and  the  relation  Q  [see  the  start  of  Chapter  6  for  a 
definition  of  §\  is  a  partial  order. 
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Tsai's  polynomials  were  then  constructed  for  directed  multigraphs  and 
were  shown  to  preserve  the  order.  These  polynomials,  like  characteristic  poly- 
nomials, do  not  lend  themselves  to  direct  comparison.  The  mapping  measure, 
MM,  discussed  in  Chapter  4  can  be  utilized  to  map  Tsai's  polynomials  to  real 
numbers. 

In  order  to  validate  characteristic  polynomials  utilizing  the  software 
complexity  measure  validation  paradigm  given  in  Appendix  B,  let  m  be  a 
[hypothetical]  data  structure  complexity  measure  which  is  strictly  monotone 
with  respect  to  the  Q  partial  order.    Then  if  a  and  b  are  data  structures  with 

associated  directed  multigraphs  SW,  and  Mu  respectively,  and  if  m(a)  <  m(6),  then 
we  cannot  have  that  MM&&J  >  9/fMgHQ,  That  is,  Tsai's  polynomials,  as  a 
complexity  measure,  is  strictly  monotone  with  respect  to  the  Q  preordering  of 
all  directed  multigraphs. 


Chapter  7 

AN  EXTENDED  COMPLEXITY 
MEASURE? 


7.1  COMBINING  SEVERAL  DATA  ITEMS  WITHIN  A  SINGLE 
SUBPROGRAM: 

Tsai's  method  yields  as  many  polynomials  as  there  are  disparate  data 
items  within  a  subprogram.  It  is  still  possible  to  compare  the  complexities  of 
different  data  structures  by  utilizing  lexicographical  ordering  on  the  space  of 
polynomials  in  one  variable.  Consider  the  following  situation.  There  are  two 
programs  that  perform  the  same  function  essentially.  Their  source  code  and 
data  structure  differ  considerably.  Further,  assume  that  the  source  code 
complexity  has  somehow  been  computed  for  each  program  and  have  been 
mapped  to  some  real  numbers.  However,  each  program  has  very  dissimilar 
data  structures,  each  with  several  segregated  data  items.  Computation  of 
Tsai's  polynomials,  therefore,  results  in  two  sets  of  data  structure  complexity 
polynomials  for  each  program  making  it  impossible  to  perform  any  comparison 
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between  the  two  programs'  data  structure  complexities.  Note  that  each  set  of 
data  structure  complexity  polynomials  may  contain  differing  number  of 
elements.  It  might  be  useful,  therefore,  if  we  could  come  up  with  a  measure 
that  encompasses  the  data  structure  complexity  of  a  given  program  no  matter 
how  many  distinct  data  definitions  the  program  possesses. 

Assume  two  programs  whose  data  structure  complexity  we  want  to 
compare.     Consider  further  that  the  following  [Pascal-like]  data  definitions 
make  up  their  data  structures: 
Data  Structure  1  Data  Structure  2 

var  var 

a  :  RECORD 
x  :  integer,  x  :  integer; 

y  :  real;  y  :  real; 

z  :  array  [1..10]  of  char;  z  :  array  [1..10]  of  char; 

END; 

Tsai's  measure: 

(1,  1,  3)  (6) 

The  data  structures  of  both  the  programs  contain  similar  data  items 
except  for  the  obvious  difference  that  the  second  program  uses  the  RECORD 
concept  to  integrate  the  data  items.  The  fact  remains  that  the  complexity  of 
individual  components  of  the  RECORD  data  structure  in  the  second  program 
are  the  same  as  that  of  the  data  items  in  the  first  program.  The  grouping 
results  in  the  summation  of  the  individual  complexities  plus  a  little  more. 
That  little  more  was  added  to  take  into  account  the  incorporation  of  the 
RECORD  data  type. 

It  might,  therefore,  be  less  objectionable  to  say  that  the  data  structure 
complexity  of  the  first  program  can  be  thought  of  as  the  sum  of  the  com- 
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plexities  of  the  various  data  items,  viz.,  1+1+3  =  5  rather  than  the  vector  (1, 
1,  3}.  If  this  argument  is  agreeable,  then  it  is  possible  to  say  that  the  data 
structure  of  the  second  program  is  more  complex  than  that  of  the  first. 
Otherwise,  no  direct  comparison  of  the  two  Tsai's  polynomials  can  be  made! 
Unless  there  are  cycles  involved  in  the  data  structure,  this  appears  reason- 
able! 

7.2  COMBINING  CHARACTERISTIC  POLYNOMIALS  AND  DATA 
STRUCTURE  COMPLEXITY: 

Combining  a  source  code  complexity  and  a  data  structure  complexity 
and  arriving  at  a  hybrid  measure  that  measures  the  complexity  of  a  software 
is  an  ultimate  goal.  In  that  sense,  the  interest  in  this  paper  is  to  see  if  Tsai's 
data  structure  complexity  measure  and  characteristic  polynomial  could 
somehow  be  collapsed  into  a  single  measure.  They  were  both  shown  to  be 
preserving  the  partial  orders  and  in  their  final  form  they  were  both  poly- 
nomials. It  appears  that  combining  them  is  a  possibility.  However,  several 
problems  have  to  be  tackled  before  that: 

••  Characteristic  polynomials  were  computed  for  only  one  program. 
Issues  relating  to  computing  characteristic  polynomials  for  real- 
world  software  programs  with  several  subroutines  within  a  single 
module  and  several  modules  within  a  single  program  need  to  be 
resolved. 
••  The  issues  with  respect  to  obtaining  a  Tsai's  data  structure 
complexity  measure  for  a  software  system  are  more  multifarious 
since  i)  disparate  data  structures  within  a  single  subroutine  yield 
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as  many  Tsai's  polynomials  as  there  are  disparate  data  structures 
and  hence  issues  relating  to  computing  Tsai's  polynomial  for  a 
subroutine  must  be  resolved;  ii)  Issues  concerning  computing  a 
data  structure  complexity  measure  for  the  software  with  several 
modules  and  subroutines  with  their  own  data  structures  will  then 
have  to  be  resolved. 

•  •       There   is    then   the   fundamental    question    of  the    validity   of 

combining  a  source  code  complexity  which  is  based  on  digraphs 
and  a  data  structure  complexity  which  is  based  on  multigraphs. 

•  •       We  have  separately  validated  both  complexity  measures  as  shown 

in  the  earlier  sections.    We  have  to  come  up  with  a  measure  for 
validating  any  such  extended  complexity  measure. 
7.3  CONCLUSIONS: 

Two  different  complexity  measures  were  detailed  in  this  paper.  One  of 
the  measures,  named,  characteristic  polynomials,  measures  the  complexity 
associated  with  source  code  and  the  other,  Tsai's  polynomial  tries  to  measure 
data  structure  complexity.  Existence  of  equivalence  classes  and  partial 
orderings  with  respect  to  both  measures  were  investigated.  Such  investigation 
was  done  in  terms  of  structured  flowgraph  transformations  in  case  of 
characteristic  polynomials  and  in  terms  of  directed  multigraph  transformations 
in  case  of  Tsai's  polynomials. 

Four  different  mapping  measures  that  map  the  complexity  measures  to 
real  numbers  were  tested  and  one  of  them  was  shown  to  preserve  the  order. 
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The  complexity  measures  were  then  validated  using  a  new  complexity  measure 
validation  paradigm. 

Future  work  will  focus  on  devising  ways  to  combine  source  code  and 
data  structure  complexity  measures.  However,  as  pointed  out  earlier  in  this 
chapter,  several  related  issues  need  to  solved  in  order  to  achieve  a  hybrid  or 
extended  measure  of  complexity. 


Appendix  A. 


GRAPHS,  RELATIONS, 

ORDERS,  FLOWGRAPHS,  AND 

FLOWGRAPH  TRANSFORMATIONS 


GRAPHS,  RELATIONS,  AND  ORDERINGS: 

More  details  on  these  and  other  concepts  can  be  found  in  Mott,  Kandel, 
and  Baker8  and  in  Kolman  and  Busby8  as  well  in  countless  other  discrete 
mathematics  texts. 


Moct,  Joe  L.,  Abraham  Kandel,  and  Theodore  P.  Baker,  Discrete  Mathematics  for  Computer 
Scientists,    Reston  Publishing  Co.,  Reston,  VA,  1983. 

"Kolman,  Bernard  and  Robert  C.  Busby,  Discrete   Mathematical    Structures    for   Computer 
Science,    2nd  Edition,  Prentice-Hall,  Inc.,  Englewood  Cliffs,  NJ,  1987. 
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A.1.  DIRECTED  GRAPHS: 

Directed  graphs,  also  called  digraphs,  are  a  way  of  viewing  relations  on 
sets.  A  pair  of  sets  Q  =  {Vfe),  where  the  elements  of  V  are  called  vertices  and 
the  elements  of  £  are  called  edges,  is  a  digraph  if  E  is  a  subset  of  <V  x  V  An 
edge  (a,b)  is  said  to  be  from  vertex  a  to  vertex  b  and  is  represented  in  a 
diagram  by  an  arrow  emanating  from  vertex  a  and  pointing  at  vertex  b.  Such 
an  edge  is  said  to  be  incident  from  a,  incident  to  b,  and  incident  on  both 
o  and  b.  [Edges  have  no  directions  in  case  of  non-directed  graphs  and  an  edge  in 
non-directed  graphs  is  represented  /a,bj.]  The  number  of  edges  incident  from 
a  vertex  is  called  the  out-degree  of  the  vertex  and  the  number  of  edges  incident 
to  a  vertex  is  called  the  in-degree. 

An  edge  from  a  vertex  to  itself  is  called  a  loop.  A  path  in  Q  is  a  sequence 
of  zero  or  more  edges  e„  ...,  e„  in  •£  such  that  «,  =  (viuvj  for  each  Hi$n.  A 
path  is  simple  if  all  edges  and  vertices  on  the  path  are  distinct,  except  that  va 
and  p„  where  v,  is  the  vertex  at  which  the  path  originates  and  v,  is  the  vertex 
where  the  path  terminates,  may  be  equal.  A  path  of  length  >  1  from  a  node 
to  itself  [where  the  end  nodes  are  the  same]  is  a  cycle  or  circuit.  A  cycle 
traversing  distinct  nodes,  except  for  the  end  nodes,  is  said  to  be  a  simple  cycle. 
A  digraph  is  acyclic  if  it  contains  no  cycle  and  it  is  cyclic  if  it  contains  at  least 
one  cycle. 

Notice  also  that  for  any  digraph  (VSi  £  is  a  binary  relation  [see  below 

for  a  brief  discussion  on  relations]  on  V.    Similarly,  any  binary  relation  %^ 
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which  is  a  subset  of^xS may  be  viewed  as  a  digraph  g  =  (A  U  %'R}.    In  this 

sense,  the  notion  of  binary  relation  on  a  set  and  the  notion  of  digraph  are 

equivalent. 

A.2.  RELATIONS: 

A  relation  %.from  a  non-empty  set  A  to  another  non-empty  set  S  is  a 

subset  of  A  k  V-  If  M  e  %.  where  aeA  and  6  e  %  then  a  is  said  to  be  related  to 
b  by  %    It  is  also  common  to  utilize  the  form  a  Hi  6  to  denote  the  relation. 
If  !£is  a  subset  of^xA  then  ^.is  expressed  as  a  relation  on  A  [i.e.,  a  H^a], 
instead  of  writing  it  as  a  relation  from  A  to  X 
A.2.1.  SOME  PROPERTIES  OF  RELATIONS: 

Binary  relations,  as  defined  above,  exhibit  the  following  special  proper- 
ties among  others:  refleiovity,  irrejle.7Q.vity,  symmetry,  asymmetry,  antisymmetry,  and  transitivity. 
[Please  refer  to  the  references  listed  in  footnotes  10  and  11  for  additional 
details.] 

PROPERTY  OF  REFLEXTvTTY: 

A  relation  ^.ona  set  A  is  rtfla&e  if  (a,a)  e  ^.for  every  a  e  X  That  is,  for 
all  a  e  x  a  H(_  a. 

PROPERTY  OF  IRREFLEXTVITY: 

A  relation  i£.on  a  set  A  is  irreflexive  if  (a,a)  ft  ^.for  every  aeX  That  is,  for 
all  ae  X  affta. 
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PROPERTY  OF  SYMMETRY: 

A  relation  !^.is  symmetric  if  whatever  (a,b)  z  %,  then  (b,a)  Z  Hd   That  is,  for  all 

az  A  and  £  e  A  if  a  ^.  £  then  b%.a. 

PROPERTY  OF  ASYMMETRY: 

A  relation  ^.ona  set  A  is  asymmetric  if  (a,b)  z  !£,  then  (6,a)  t  %    That  is, 

for  all  a  z  A  and  bz  A,i(  a%_b  then  £  #l *• 
PROPERTY  OF  ANTISYMMETRY: 

A  relation  ^,ona  set  A  is  antisymmetric  if  whenever  (a,b)  e  ^.  and  ("W  e  %_ 

then  a  =  £.    That  is,  for  all  a  z  ^  and  £  e  .3,  if  a  %,  b  and  £  %.  a  then  a  =  b. 

Utilizing  the  contra-positive  of  the  above  implication,  it  can  be  reformulated 
as,  if  a  /  b,  then  either  a  $(, 6  or  b%.a. 

PROPERTY  OF  TRANSITP/ITY: 

A  relation  is  transitive  if  whenever  (a,b)z  ^.and  (b,c)z  %.  then  (a,c)z  H^   That 

is,  if  a  "Sib  and  £!£.<:  then  a^c. 

A.2.2.  EQUIVALENCE  RELATIONS: 

Equivalence  relations  are  the  primary  tools  employed  in  the  process  of 
abstraction,  or  selectively  ignoring  differences  which  are  irrelevant  to  the 
purpose  at  hand.  Within  a  given  context,  we  say  that  two  things  are 
equivalent  if  the  differences  between  them  do  not  matter. 

In  terms  of  formal  mathematics,  a  binary  relation  is  an  equivalence 
relation  if  it  is   reflexive,   symmetric,   and  transitive.      Given  a   set  A,  an 
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equivalence  relation  %.  on  A  is  defined  to  be  a  binary  relation  on  A  that 

satisfies  the  properties  of  reflexivity,  symmetry,  and  transitivity. 

This  property  of  equivalence  expresses  important  aspects  of  being  the 
same  which  are  ordinarily  taken  for  granted  and  are  usually  obvious  for 
specific  equivalence  relations.  Another  way  of  looking  at  equivalence  relations 
is  as  a  means  of  dividing  things  into  classes.  Any  time  a  set  is  partitioned  [see 
the  next  section  for  a  definition  of  partition]  into  disjoint  subsets  an  equiv- 
alence relation  is  involved.  The  notions  of  equivalence  relations  and  equiv- 
alence classes  are  interchangeable. 
A.2.2.1.  Partitions: 

Given  a  set  A  a  partition  of  j?  is  a  collection  T  of  disjoint  subsets  whose 
union  is  A  That  is, 

1.  for  any  A,  e  %  A,  is  a  subset  of  A: 

2.  for  any  A,  e  <B  and  A,  e  %  A,  n  A,  =  $,  or  &,  =  Aj  and 

3.  for  any  a  e  A,  there  exists  A,  e  <£  such  that  a  e  A,. 
A.2.2.2.  Equivalence  Classes: 

Given  any  set  A  and  any  equivalence  relation  ^,onjl  the  equivalence  class 
la]  of  each  element  a  of  A  is  defined  la]  =  {bzA  \  a%.  i).  Note  that  we  can  have 
la]  =  lb],  even  if  a  J  b,  provided  aH^b.  That  is,  this  notation  does  not  give  a 
unique  name  to  each  equivalence  class. 

Digraphs  can  be  treated  as  a  special  case  of  a  more  general  kind  of 
graph,  called  directed  multigraph,  just  as  binary  relations  are  a  special  case  of 
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n-ary  relations.     [Directed  multigraphs  are  discussed  in  a  following  subsec- 
tion.]    However,  note  that  multigraphs  are  not  all  relations,  just  as  n-ary 
relations  are  not  graphs  except  when  n  =  2. 
A.3.  ORDERINGS: 
A.3.1.  PREORDER: 

Given  a  set  A,  a  relation  ^,is  said  to  be  a  unordering  if  set  A  satisfies  the 

properties  of  reflexivity  and  transitivity. 
A.3.2.  PARTIAL  ORDER: 

Given  a  set  A  a  partial  order  %_  on  the  set  A  is  defined  to  be  a  binary 

relation  on  A  that  satisfies  the  properties  of  reflexivity,  antisymmetry,  and 
transitivity. 

The  word  partial  in  partial  ordering  comes  from  the  fact  that  the  axioms 
of  reflexivity,  antisymmetry,  and  transitivity  do  not  guarantee  that  for  every 
pair  (a,6)  £  A  x  A  at  least  the  relations  a  %,  6  or  6  5L  a  must  hold.  That  is, 
partial  order  does  not  guarantee  that  every  pair  of  elements  is  always 
comparable.    The  set  A  together  with  the  partial  order  3?.  is  called  a  partially 

ordered  set,  or  poset  and  is  denoted  by  (A,%}. 
A.3.3.  TOTAL  ORDER: 

If  set  A  is  a  partially  ordered  set  satisfying  relation  3f.and  if  for  every 
a  and  6  in  A,  either  a  <  6  or  6  <,  a,  then  ^.is  called  a  total  ordering.  It  is  also 
called  linear  ordering. 
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A.4.  MONOTONICITY: 

Let  a&  A  and  6  e  H  and  H  be  a  preordered  set.    Further,  assume  that  / 
is  a  function  from  the  preordered  set  &  to  a  partially  ordered  set  3.    The 
function  /  is  monotone  if  whenever  a  <  £  then  /fo)  S  /JBJI   /  is  said  to  be  stricthj 
monotone  if  whenever  a  <,  b  and  a  /  &  then  /fo)  <  /jSjl 
A.5.  DIRECTED  MULTIGRAPHS: 

A  multigraph  is  a  triple  (V,iSi,  consisting  of  a  set  of  vertices  V,  a  set  of 
edges  %  and  an  incidence  junction  f  that  maps  each  edge  in  £  to  a  pair  of  vertices 
in  V.  For  a  dincted  mtdtigrapA,  the  incidence  function  maps  edges  to  ordered  pairs 
of  vertices.  In  this  case  an  edge  e  is  said  to  be  from  vertex  a  to  vertex  6  if 
and  only  if  t(e)  =  (a,6).  [For  non-directed  midtigrapns  the  incidence  function  maps 
edges  to  unordered  pairs  of  vertices.  In  this  case,  an  edge  e  is  said  to  be 
between  vertex  a  and  vertex  6  if  and  only  if  f(e)  =  (a,6j.] 

Note  that  directed  multigraphs  [as  well  non-directed  multigraphs]  are 
multigraphs  because  they  may  contain  more  than  one  distinct  edge  between 
two  vertices.  This  does  not  satisfy  the  regular  definition  of  graph  since  graphs 
allow  only  one  distinct  edge  between  any  two  vertices.  This  implies  that  edges 
of  directed  (non-directed)  multigraphs  can  not  be  expressed  as  a  set  of  ordered 
(unordered)  pairs  of  vertices. 
A.6.  MULTIGRAPH  TRANSFORMATIONS: 

Most  of  the  material  reviewed  in  the  appendixes  can  be  found  in  the 
literature  elsewhere.     This  section,  however,  was  developed  by  the  author. 
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The  insight  for  the  material  presented  here  comes  from  Baker10,  et.  al.  The 
need  for  this  material  will  be  seen  in  the  example  given  as  part  of  Tsai's 
polynomials  where  multigraph  representing  data  structures  are  transformed 
for  the  purpose  of  computing  Tsai's  polynomials.  These  three  transformations 
enable  adding  to  or  enhancing  a  multigraph. 

For  the  following  transformations  assume  a  directed  multigraph  Of  = 
(V,%f)  where  V  denotes  the  set  of  vertices,  %  the  set  of  edges,     and  f,  an 
incidence  function  that  maps  each  edge  in  £  to  a  pair  of  vertices  in  1/,  i.e.,  an 
edge  c  is  said  to  be  from  vertex  a  to  vertex  6  if  and  only  if  {(e)  =  (a,S). 
A.6.1.  NODE  ADDITION  TRANSFORMATION: 

Given  a  directed  multigraph  at  •  CU,%{),  a  new  directed  multigraph  at'  = 
(<Vu  vfr  11  «,f;can  be  obtained  from  at  using  a  node  addition  transformation  if 

1.         v  #V, 


Baker,  et.  al.,  op_.  clt..  where  they  describe  three  transformations  that  can  be 
performed  on  flowgraphs,  viz.,  nod*  rtptvatwn,  tdgi  addition,  and  pain  addition  trawformatums.  Assume  a 
flowgraph  g  -  (H  Tj,t),  where  ^denotes  the  set  of  nodes,  I,  the  set  of  edges,  s,  the  start  node,  and  i,  the 
terminal  node.    Then  the  following  three  transformations  can  be  defined  on  a  flowgraph. 

NODE  REPLICATION  TRANSFORMATION: 

Given  a  flowgraph  g-(f(/LMJ,a  new  flowgraph  g' .  CHU i!L U,t) can  be  obtained  from  fusing  a  node 
replication  transformation  if 

1.  H% 

2.  there  is  one  edge  (a,b)  e  I'  where  a  e  % 

3.  I'  includes  one  edge  (6,d  for  each  edge  of  the  form  (a,c)  in  I, 

4.  I'  does  not  include  any  of  the  edges  in  £  of  the  form  (*4  and 

5.  except  for  the  changes  indicated  in  items  3.  and  4.  above,  x'is  the  same  as  £. 

EDGE  ADDITION  TRANSFORMATION: 

Given   a   flowgraph   g  .  Pitj.t),  then    §'  -  (%<E  V  v,t)  is   obtained  from   g  by  an   edge   addition 
transformation  if  t  is  an  edge  from  node  a  to  node  b,  where  ae^'ie^tg'x. 

PATH  ADDITION  TRANSFORMATION: 

Given  a  flowgraph  g  -  PitMh  then  f-ftliiio  Mt«M  is  obtained  by  path  addition 
transformation  ifetf3£*ae:^*£e% 
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2.  t  &£,  i.e.,  there  does  not  exist  in  :W  either  f(e)  =  fcyjor  t(e)  =  (yd  for 
any  %,  y  e  V,  and 

3.  For  every  t(a)  =  fay)  for  any  *,  yz  V,  where  a  e  £  there  exists  f  feZ 
=  (icy).  In  addition,  there  exists  in  94'  either  f  (t)  =  fc£j  or  F$  =  (6,k) 
for  some  nee  94. 

A.6.2.  NODE  REPLICATION  TRANSFORMATION: 

Given  a  directed  multigraph  94  =  (VjEfi  a  new  directed  multigraph  94'  = 
(VIA  6,X',F jean  be  obtained  from  fusing  a  node    replication  transformation  if 

1.  v#V, 

2.  there  exists  one  edge  el,  t(U)  =  (a,v),  belongs  to  £'  where  «eV, 

3.  £' includes  one  edge  tZ,  t(a)  =  (WJ  for  each  edge  i  of  the  form  t{t) 
=  (a,6)  in  £, 

4.  £'  does  not  include  any  of  the  edges  in  X,  of  the  form  t(e)  =  (a,6), 
and 

5.  except  for  the  changes  indicated  in  items  3.  and  4.  above,  £'is 
the  same  as  t. 

A.6.3.  EDGE  ADDITION  TRANSFORMATION: 

Given  a  directed  multigraph  94  =  (V,%f),  a  new  directed  multigraph  94'  = 
(V,T.  U  e,f ;  can  be  obtained  from  94  using  an  edge  addition  transformation  if 

1.  the  incidence  function  f  is  such  that  for  every  f(e)  =  (a,6)  there 
exists  F(t)  =  (a,6)  for  all  a,6  e  V, 

2.  however,  f  (e)  =  fa,£/  does  not  mean  that  there  exists  f(e)  =  (a,6),  and 
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3.        the  number  of  edges  of  the  form  (a,B)  in  !M"is  one  greater  than  the 
number  of  edges  of  the  same  form  in  M. 
A.7.  FLOWGRAPHS: 

A  flowgraph  is  basically  a  digraph.  However,  to  be  treated  as  flow- 
graphs,  additional  restrictions  are  imposed  on  digraphs.  Notations  used  also 
differ  somewhat.  Vertices  of  digraphs  correspond  to  nodes  in  flowgraphs  and 
edges  of  digraphs  are  sometimes  also  known  by  the  term  ores. 

1.  a  single  vertex  s  e  V  is  considered  the  start  node  through  which 
entry  into  the  flowgraph  occurs; 

2.  a  single  vertex  t  e  1/  is  considered  the  terminal  node  through  which 
an  exit  is  made  out  of  the  flowgraph; 

3.  there  exist  path(s)  from  the  start  node  to  all  the  other  nodes  in 
the  flowgraph; 

4.  there  exist  path(s)  from  any  node  to  the  terminal  node; 

5.  except  for  the  above  four  restrictions,  the  flowgraph  is  essentially 
a  digraph. 

A  flowgraph,  therefore,  can  be  denned  as  Q  =  U(l%s,t),  where  2\£is  a  set 
of  nodes,  t  is  a  set  of  edges,  and  s  and  t  are  start  and  end  nodes  respectively. 
A.8.  STRUCTURED  FLOWGRAPHS: 

The  foregoing  flowgraph  definition  [and  the  three  flowgraph  transforma- 
tions as  discussed  in  footnote  12]  will  aid  in  drawing  a  flowgraph  for  any 
program  including  non-structured  ones.  One  structured  flowgraph  [see  below 
for  a  definition  of  structured  flowgraph]  can  be  transformed  into  a  new,  more 
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complex  structured  flowgraph  by  utilizing  a  set  of  node  replication,  edge 
addition,  and/or  path  addition  flowgraph  transformations.  While  the  original 
flowgraph  and  the  transformed  flowgraph-after  several  likely  transformations- 
-may  be  structured,  the  intermediate  flowgraphs  might  not  have  retained  the 
structuredness  property.  Since  characteristic  polynomials  are  computed  only 
for  structured  flowgraphs,  there  are  several  intermediate  transformations  of 
flowgraphs  for  which  no  characteristic  polynomial  can  be  computed.  Discus- 
sion, therefore,  is  made  in  terms  of  structured  flowgraphs  and  structured 
flowgraph  transformations.  This  subsection  describes  structured  flowgraphs 
and  transformations  that  retain  the  structuredness  of  the  flowgraphs  intact. 

A  structured  flowgraph,  first  of  all,  obeys  the  restrictions  imposed  on 
flowgraphs  [see  Section  A.8].  It  can  be  easily  shown  that  those  restrictions 
are  not  sufficient  to  guarantee  structured  flowgraphs.  We  will  use  the 
following  well-known  ideas  in  defining  structured  flowgraphs.  [The  use  of 
these  ideas  results  from  the  formal  demonstration  by  Bbhm  and  Jacopini" 
that  any  well-behaved  computer  program,  i.e.,  one  without  an  infinite  loop 
could  be  constructed  using  only  the  if/tfun/eCsc  and  uifiik/do  control  statements 
and  the  appropriate  sequential  operations.] 

A  structured  flowgraph  is,  as  the  name  suggests,  a  flowgraph  and  hence 
is  a  digraph.  However,  the  nodes  can  be  differentiated  depending  on  the 
basis  of  their  location  and  the  degrees.  [The  differences  among  the  nodes  is 
often  shown  by  using  different  shapes  and  it  is  a  common  knowledge  that 


BOhm,  Corrado  and  Giuseppe  Jacopini,  "Flow  Diagrams,  Turing  Machines,  and  Languages 
with  Only  Two  Formation  Rules",  Communications  of  the  ACM,    vol.  9,  no.  5,  May  1?66,  pp.  366- 
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such  diagrams  are  called  flowcharts.]  The  nodes  are  classified  into  actum  nodes, 
condition  nodes,  and  junction  nodes12.  Action  nodes  have  an  in-degree  of  1  and  an 
out-degree  of  1.  Condition  nodes  have  an  in-degree  of  1  and  an  out-degree  of 
more  than  1.  Condition  nodes  having  an  out-degree  of  2  is  also  called  a 
predicate  node.  Junction  nodes  possess  an  in-degree  greater  than  1  and  an  out- 
degree  of  1.  There  are  two  exceptions  to  junction  nodes:  the  entry  node  has  an 
in-degree  of  0  and  an  out-degree  of  1  and  the  exit  node  has  in-degree  greater 
than  0  but  out-degree  equal  to  zero.  As  we  have  noted  earlier,  a  flowgraph 
must  have  an  entry  node  and  an  exit  node. 

Using  the  conventional  pictorial  representations,  action  nodes  are  drawn 
as  rectangles,  condition  nodes  as  diamonds,  and  junction  nodes  as  circles. 
Nevertheless,  flowgraphs  are  still  digraphs. 

The  simplest  nontrivial  flowgraph  [Figure  A.  1(a)]  consists  of  a  single 
action  node  having  one  predecessor  node  [the  entry  node]  and  one  successor 
node  [the  exit  node].  Let  this  flowgraph  be  called  basic  flowgraph.  Given  this 
basic  flowgraph,  more  complex  flowgraphs  can  be  constructed  by  applying 
what  will  be  called  structured  flowgraph  transformations. 
'Definition  of  Structured  flowgraphs: 

1.  a  structured  flowgraph  satisfies  the  restrictions  of  flowgraphs  as 
indicated  in  Section  A.8., 

2.  the  basic  flowgraph  is  a  structured  flowgraph,  and 


For  more  on  this  topic  see  Art  Lew,  Computer  Science:  A  Mathematical  Introduction, 
Prentice-Hall,  Englewood  Cliffs,  1985. 
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3.         a  structured  flowgraph  on  which  any  or  all  of  the  following  three 
transformations  were  applied  is  still  a  structured  flowgraph. 
A.8.1.  STRUCTURED  FLOWGRAPH  TRANSFORMATIONS: 

The  basic  structured  flowgraph  [Figure  A  1(a)]  consists  of  a  single  action 
node  along  with  an  entry  node  and  an  exit  node.  More  complex  structured 
flowgraphs  can  be  constructed  by  combining  simpler  flowgraphs  in  one  of  the 
following  three  ways.  Such  methods  are  labeled  structured  flowgraph.  transformations 
in  this  paper. 
A.8.1. 1.  Composition: 

Flowgraphs  are  combined  sequentially  by  letting  the  exit  node  of  one 
structured  flowgraph  coincide  with  the  entry  node  of  another,  resulting  in  a 
composite  structure.     This  is  the  simplest  of  the  three  transformations  and  is 
exhibited  in  Figure  A.  1(b). 
A.8.1.2.  Alternation: 

This  transformation  results  in  flowgraphs  that  depict  if-then-else  and 
case  structures.  In  this  case,  flowgraphs  are  combined  in  parallel  by  letting 
two  or  more  structured  flowgraphs  be  successors  of  a  common  predicate  node 
and  also  letting  them  have  a  common  exit  node,  resulting  in  an  alternation 
structure.  Figure  A.  1(c)  shows  the  resulting  structured  flowgraph  for  an  if- 
then-else  structure.  A  flowgraph  for  a  case  structure  can  similarly  be  drawn 
easily. 
A.8.1.3.  Iteration: 

Flowgraphs  can  be  transformed  by  placing  flowgraphs  within  a  loop 
with  a  predicate  node   at  its  beginning  or  end  to  govern  its  termination, 
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resulting  in  an  iterative  structure.   Depending  on  the  location  of  the  predicate 
node,  the  resulting  structured  flowgraph  represents  the  familiar  while-do  or 
repeat-until  structure  [Figure  A.  1(d)  and  Figure  A.l(e)\. 

Despite  the  fact  that  we  have  discussed  structured  flowgraph  transfor- 
mations in  terms  of  composition,  alternation,  and  iteration,  they  are  still  in 
essence  node  addition,  edge  addition,  and  path  addition  flowgraph  transforma- 
tions [see  Footnote  12].  However,  these  higher  level  transformations  prevent 
the  creation  of  a  non-structured  flowgraph  from  a  structured  flowgraph. 


Appendix  B. 


A  VALIDATION  PARADIGM 


B.l.  A  SOFTWARE  COMPLEXITY  MEASURE  VALIDATION  PARADIGM: 

The  contents  of  this  section  is  ascribed  to  the  work  by  Baker13,  et.  al. 
Let  A  be  a  set  [of,  perhaps,  programs]  and  let  ii  be  the  set  of  real  numbers. 
Now  one  can  assume  defining  a  software  complexity  measure  m  which  gauges 
some  component  of  complexity  of  the  elements  in  A  by  mapping  A  into  Q. 

A  typical  validation  paradigm  for  software  complexity  measures  research  would 
proceed  to  answer  the  question:  Does  the  complexity  measure  m  accurately 
quantify  the  intended  component  of  software  complexity?  The  paradigm  would 
accomplish  that  by  trying  to  obtain  some  ranking  of  A  which  is  obtained 
independently  from  m  but  which  can  still  be  used  to  establish  the  worth  of  m. 
Note  that  this  paradigm  assumes  the  existence  of  a  ranking  by  complexity  of 
A.  However,  since  it  is  impossible  to  rank  all  the  elements  of  A,  the  ranking 
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of  A  is  usually  approximated  by  first  choosing  a  subset  .3 'of  A  and  then  by 
selecting  and  using  some  process  data  [pd\  for  each  element  of  the  subset  A'. 
Collection  of  the  process  data  might  be  done  in  several  possible  ways:  by 
using  an  experimental  population  to  directly  obtain  a  ranking  of  the  elements 
of  .a' either  by  subjective  rankings  or  by  other  experiments;  by  compiling  data 
such  as  fines  of  code,  size  of  the  compiled  code,  average  length  of  execution 
time,  person-hours  used  to  produce  each  element  of  A',  number  of  reported/cor- 
rected errors  or  changes,  etc. 

The  process  data  pd(aj,  for  all  a,  e  J?' is  selected  with  the  intention  [or 
assumption]  that  if  pd(aj  <  pdltij)  ihen  pranks  the  set  of  elements  .a 'and  hence 
this  ranking  is  significant.  The  values  for  pd(aj  for  all  a,  e  ;?' facilitates  the 
examination  of  the  quantitative  relationship  between  the  set  of  values  for  ptC 
and  m  on  the  elements  of  A'.  If  the  way  that  pranks  .3 'is  sufficiently  similar 
to  the  way  that  m  ranks  A',  then  mis  concluded  to  rank  the  elements  of  A. 

Baker,  et.  al.,  note  three  problems  with  the  above  paradigm  for 
validation  of  software  complexity  measures: 

1.  it  is  unknown  that  the  presumed  complexity  ranking  of  the 
domain  set  A  exists; 

2.  it  is  unknown  that  the  subset  A' oi  A  is,  in  some  rigorous  sense, 
representative  of  the  domain  set  A:  and 

3.  it  is  unknown  that  the  selected  process  data  pdis  an  adequate 
approximation  of  the  elusive  complexity  ranking. 
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The  ranking  of  the  elements  of  A  that  is  desired  to  validate  the 
complexity  measure  is  referred  to  in  discrete  mathematics  as  orders  on  the 
domain  set  A.  The  prime  goal  of  software  complexity  measures  research  is  to 
find  functions  [measures]  which  preserve  these  orders.  Validating  complexity 
measures  can  be  thought  of  as  a  secondary  goal.  Specifically,  validating 
characteristic  polynomials  and  Tsai's  polynomials  is  one  of  the  missions  of  this 
paper. 

An  assumption  in  the  software  complexity  measures  paradigm  is  that 
A  is  at  least  a  preordered  set  and  that  m  is  a  monotone  function  from  A  to  Q 
which  is,  of  course,  linearly  ordered.  A  can  not  be  assumed  to  be  linearly 
ordered  since  A  is  not  known  to  be  a  partially  ordered  set.  Note,  however, 
that  it  is  difficult  to  produce  preorders  for  each  possible  set  A  since  it  is  an 
unknown. 

In  the  complexity  measures  validation  paradigm  that  was  labeled  typical, 
validation  of  complexity  measures  is  attempted  by  working  with  a  subset  A' 
of  the  domain  set  A  In  order  to  sidestep  some  of  the  problems  inherent  in 
working  with  a  subset  A'  of  A  Baker,  et.  al.,  suggest  avoiding  #' altogether 
and  working  with  a  relation  on  the  domain  A  which  is  weaker  [see  below  for 
a  definition  of  uieakzr  ordering],  but  more  practical,  than  the  elusive  preorder  on 
A.  Thus,  to  validate  a  software  complexity  measure  they  try  to  show  that  the 
measure  is  a  monotone  function  from  the  more  weakly  (pre)ordered  set  to  Q. 
The  particular  ordering  considered  by  Baker,  et.  al.,  is  called  containment  ordering, 
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B.2.  WEAKER  ORDERING: 

An  ordering  ^.is  weakfr  than  an  ordering  Tif  ^.is  a  subset  of  T  i.e., 
whenever,  two  elements  are  comparable  in  %,,  they  are  also  comparable-with 
the  same  relative  positions-in  T  This  can  also  be  written  as  follows:  an 
ordering  ^.is  weaker  than  an  ordering  Tif  for  all  ^  y,  if  fcyje  ^then  fcy)£ 
t 

B.3.  CONTAINMENT  ORDERING: 

Let  A  be  a  set  of  versions  of  a  program  and  let  a  e  A  and  b  e  A.  The 
versions  a  and  £  are  related  by  containment  ordering,  i.e.,  a  <t  £  if  and  only  if  a  is 
contained  in  £.  Informally,  this  can  be  taken  to  mean  that  a  can  be  found 
within  i.  For  example,  a<,  6  may  mean  that  every  statement  that  is  in  a  is 
also  in  i  or  that  the  flowgraph  for  a  is  a  subgraph  of  the  flowgraph  for  B. 

Note  that  the  relation  <  is  a  partial  order  on  the  set  A  since  a  <  i 
satisfies  reflexivity,  antisymmetry,  and  transitivity  properties  for  all  at  A  and 
bz  A.  Hence,  <  is  also  a  preorder  on  A.  Note  also  that  containment  ordering 
leads  to  a  family  of  relations  since  containment  can  be  defined  in  different 
ways.  It  is  the  authors'  contention  that  the  following  two  statements  are 
true: 

1.  every  software  complexity  measure  on  A  should  be  monotone  from 
A  preordered  by  <„  to  £2,  and 
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2.         <t  is  nontrivial,  i.e.,  there  is  enough  structure  given  A  by  <,  that 
requiring  a  measure  to  be  monotone  with  respect  to  S,  does  indeed 
say  a  lot  about  the  measure. 
B.4.  VALIDATION  OF  A  SOFTWARE  COMPLEXITY  MEASURE: 

If  a  e  A  and  i  e  A  [say,  two  versions  of  a  program]  and  if  a  £,  6  by  some 
well-defined  sense  of  containment,  then,  in  general,  any  complexity  inherent 
in  o  is  also  inherent  in  6.  Therefore,  for  any  complexity  measure  m  defined  on 
A,  what  is  desired  is  that  m(a)  <  m(6).  It  is  possible  that  m(a)  »  m(6)  or  that  a  and 
6  are  not  comparable  at  all  with  respect  to  <„•  but  if  a  <,,  S,  then  m(a)  <,  m(6)  must 
hold. 

The  above  result  leads  to  the  following  validation  procedure:  suppose 
that  validation  of  a  new  software  complexity  measure  m  is  desired.  Assume 
m  is  intended  to  measure  a  particular  component  of  complexity  in  the  elements 
of  A.  As  indicated  earlier,  it  is  not  possible  to  determine  the  true  associated 
complexity  preordering  on  A,  but  it  is  possible  to  verify  that  m  is  monotone 
with  respect  to  <„  for  one  or  more  definitions  of  containment. 

Baker,  et.  al.,  argue  that  m  being  monotone  with  respect  to  <  really  says 
a  lot  about  m  with  respect  to  the  elusive  preordering.  One  could  easily  verify 
that  m  can  distinguish  between  the  lack  of  this  component  of  complexity  and 
a  small  amount  of  it.  Let  a  e  A  be  a  simple  version  with  none  of  the 
component  of  complexity.  If  o  £,  6,  then  6  must  have  at  least  as  much  of  the 
component  of  complexity  as  a.  In  such  a  case,  the  measure  m  must  be  such 
that  m(a)  <  m(6).   This  sort  of  comparison  can  be  performed  on  every  pair  of 
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versions  which  are  comparable  under  <,,.    Whenever  a  <,  i,  the  measure  m  must 
rank  a  and  6  such  that  m(a)  S  m(6). 

Note,  however,  that  if  a  and  6  are  not  comparable  under  <0  then  verifying 
that  a  measure  m  is  monotone  with  respect  to  <.,  does  not  say  anything  about 
how  m  will  work  on  u  and  6.  Showing  that  m  is  monotone  with  respect  to  2, 
only  shows  that  m  is  monotone  with  respect  to  an  ordering  which  is  weaker 
than  the  elusive  preordering  and  the  weaker  ordering  is  not  trivial.  Thus 
showing  mis  monotone  with  respect  to  <  can  be  thought  of  as  a  good  "disaster 
check"  for  m  and  we  can  have  confidence  that  m  will  do  a  reasonably  good  job 
of  ordering  a  and  B. 
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Complexity  associated  with  a  software  program  is  made  up  of  two 
components,  namely,  source  code  complexity  and  data  structure  complexity. 
There  is  a  multitude  of  complexity  measures  that  try  to  assess  these  two 
components.  This  report  enumerates  a  source  code  complexity  measure 
called  characteristic  polynomial  and  a  data  structure  complexity  measure  called 
isai's  polynomials.  The  method  of  computation  of  each  measure  is  detailed. 

Characteristic  polynomials  are  computed  for  the  structured  flowgraph 
representations  of  programs.  A  new  relation  is  defined  on  structured 
flowgraphs  in  terms  of  structured  flawgrapk  transformations.  This  relation  on  the 
structured  flowgraphs  is  shown  to  be  a  partial  ordering  but  not  an 
equivalence  ordering.  This  argument  is  extended  to  show  a  partial  ordering 
associated  with  characteristic  polynomials. 

On  the  other  hand,  Tsai's  polynomials  are  computed  for  directed 
multigraph  representations  of  data  structures.  A  set  of  transformations 
called  directed  multigrapfi.  transformations  is  devised  and  a  new  relation  is  defined 
on  directed  multdgraphs  in  terms  of  directed  multigraph  transformations. 
This  new  relation  on  directed  multigraphs  is  again  demonstrated  to  be  a 
partial  ordering  but  not  an  equivalence  ordering.  Further,  Tsai's 
polynomials  are  shown  to  exhibit  a  partial  ordering. 

A  measure  that  maps  these  two  polynomials  to  reals  is  devised.  This 
mapping  measure  facilitates  direct  comparison  of  a  set  of  characteristic 
polynomials  or  Tsai's  polynomials.  Some  issues  involved  in  obtaining  a 
hybrid  complexity  measure  by  combining  source  code  complexity  and  data 
structure  complexity  is  stated. 


